Novel method for targeted delivery of nucleic acids

ABSTRACT

The present invention is directed to a method of in vivo and ex vivo gene delivery, for a variety of cells. More specifically, it relates to a novel carrier system and method for targeted delivery of nucleic acids to mammalian cells. More specifically, the present invention relates to carrier system comprising single-chain polypeptide binding molecules having an a region rich in basic amino acid and having the three dimensional folding and, thus, the binding ability and specificity, of the variable region of an antibody. The basic amino acid rich region can comprise oligo-lysine, oligo-arginine or combinations thereof. Such preparations of modified single chain polypeptide binding molecules also have ability to bind nucleic acids at the region rich in basic amino acid residues. These properties of the modified single chain polypeptide binding molecules make them very useful in a variety of therapeutic applications including gene therapy. The invention also relates to multivalent antigen-binding molecules having regions rich in basic amino acids. Compositions of, genetic constructions for, methods of use, and methods for producing basic amino acid tailed antigen-binding proteins are disclosed.

[0001] The present application is a divisional application of U.S. Appl.Ser. No. 09/420,592, filed Oct. 19, 1999, which claims benefit of thefiling date of U.S. Appl. No. 60/104,949, filed Oct. 20, 1998, each ofwhich disclosure is incorporated herein in entirety by reference.

BACKGROUND OF THE INVENTION

[0002] 1. Field of the Invention

[0003] The present invention is directed to a method of in vivo and exvivo gene delivery, for a variety of cells. More specifically, itrelates to a novel carrier system and method for targeted delivery ofnucleic acids to mammalian cells. More specifically, the presentinvention relates to carrier systems comprising single-chain polypeptidebinding molecules having a basic amino acid rich region, such as anoligo-lysine or an oligo-arginine region, and having the threedimensional folding and, thus, the binding ability and specificity, ofthe variable region of an antibody. Such preparations of modified singlechain polypeptide binding molecules also have ability to bind nucleicacids at the basic amino acid rich region. These properties of themodified single chain polypeptide binding molecules make them veryuseful in a variety of therapeutic applications including gene therapy.The invention also relates to multivalent antigen-binding moleculeshaving basic amino acid rich regions. Compositions of, geneticconstructions for, methods of use, and methods for producing such basicamino acid rich region containing antigen-binding proteins aredisclosed.

[0004] 2. Background Art

[0005] Substantial attention has been given to the promise of genetherapy in recent years. This term has been used to describe a widevariety of methods using recombinant biotechnology techniques to delivera variety of different materials to a cell. Such methods include, forexample, the delivery of a gene, antisense RNA, a cytotoxic agent, etc.,by a vector to a mammalian cell, preferably a human cell either in vivoor ex vivo. Most of the initial work has focused on the use ofretroviral vectors to transform these cells. This focus has resultedfrom the ability of retroviruses to infect cells with high efficiency.

[0006] However, numerous difficulties with retroviruses have beenreported. For example, problems have been encountered in infectingcertain cell types. Retroviruses typically enter cells via receptors andif such receptors are not present on the cell, or not present in largenumbers, then infection is not possible or efficient. These viruses arealso relatively labile in comparison to other viruses. Outbreaks ofwild-type virus from recombinant virus-producing cell lines have alsobeen reported with the vectors themselves causing disease. Moreover,these viruses are only expressed in dividing cells.

[0007] In addition, retroviral-mediated gene transfer methods typicallyresult in stable transformation of the target cells. Although this maybe regarded as advantageous, the stable transformation of a patient'ssomatic cells makes it difficult to reverse the treatment regimen ifundesirable side effects occur. Moreover, there is the concern thatgenetic transformation might lead to malignant transformation of thecell.

[0008] Other methods of delivering genetic material to cells in vivo andex vivo include the use of liposome entrapped DNA. Liposomes are smallmembrane-enclosed spheres that have been formed with the appropriate DNAentrapped within it. However, this system also has inherent problems. Itis difficult to control the size of the liposome and, hence theuniformity of delivery to individual cells. Additionally, it isdifficult to prevent leakage of the contents of the liposomes and aswith other techniques, there has been difficulty in directing cell-typespecificity.

[0009] Antibodies are proteins generated by the immune system to providea specific molecule capable of complexing with an invading molecule,termed an antigen. Natural antibodies have two identical antigen-bindingsites, both of which are specific to a particular antigen. The antibodymolecule “recognizes” the antigen by complexing its antigen-bindingsites with areas of the antigen termed epitopes. The epitopes fit intothe conformational architecture of the antigen-binding sites of theantibody, enabling the antibody to bind to the antigen.

[0010] The antibody molecule is composed of two identical heavy and twoidentical light polypeptide chains, held together by interchaindisulfide bonds. The remainder of this discussion on antibodies willrefer only to one pair of light/heavy chains, as each light/heavy pairis identical. Each individual light and heavy chain folds into regionsof approximately 110 amino acids, assuming a conserved three-dimensionalconformation. The light chain comprises one variable region (V_(L)) andone constant region (C_(L)), while the heavy chain comprises onevariable region (V_(H)) and three constant regions (C_(H)1, C_(H)2 andC_(H)3). Pairs of regions associate to form discrete structures. Inparticular, the light and heavy chain variable regions associate to forman “Fv” area which contains the antigen-binding site. The constantregions are not necessary for antigen binding and in some cases can beseparated from the antibody molecule by proteolysis, yieldingbiologically active (i.e., binding) variable regions composed of half ofa light chain and one quarter of a heavy chain.

[0011] Further, all antibodies of a certain class and their F_(ab)fragments (i.e., fragments composed of V_(L), C_(L), V_(H), and C_(H)1)whose structures have been determined by x-ray crystallography showsimilar variable region structures despite large differences in thesequence of hypervariable segments even when from different animalspecies. The immunoglobulin variable region seems to be tolerant towardsmutations in the antigen-binding loops. Therefore, other than in thehypervariable regions, most of the so-called “variable” regions ofantibodies, which are defined by both heavy and light chains, are, infact, quite constant in their three dimensional arrangement. See forexample, Huber, R., Science 233:702-703 (1986).

[0012] Recent advances in immunobiology, recombinant DNA technology, andcomputer science have allowed the creation of single polypeptide chainmolecules that bind antigen. These single-chain antigen-bindingmolecules (“SCA”) or single-chain variable fragments of antibodies(“sFv”) incorporate a linker polypeptide to bridge the individualvariable regions, V_(L) and V_(H), into a single polypeptide chain. Adescription of the theory and production of single-chain antigen-bindingproteins is found in Ladner et al., U.S. Pat. Nos. 4,946,778, 5,260,203,5,455,030 and 5,518,889. The single-chain antigen-binding proteinsproduced under the process recited in the above U.S. patents havebinding specificity and affinity substantially similar to that of thecorresponding Fab fragment. A computer-assisted method for linker designis described more particularly in Ladner et al., U.S. Pat. Nos.4,704,692 and 4,881,175, and WO 94/12520.

[0013] The in vivo properties of sFv polypeptides are different fromMAbs and antibody fragments. Due to their small size, sFv polypeptidesclear more rapidly from the blood and penetrate more rapidly intotissues (Milenic, D.E. et al., Cancer Research 51:6363-6371 (1991);Colcher et al., J Natl. Cancer Inst. 82:1191 (1990); Yokota et al.,Cancer Research 52:3402 (1992)). Due to lack of constant regions, sFvpolypeptides are not retained in tissues such as the liver and kidneys.Due to the rapid clearance and lack of constant regions, sFvpolypeptides will have low immunogenicity. Thus, sFv polypeptides haveapplications in cancer diagnosis and therapy, where rapid tissuepenetration and clearance, and ease of microbial production areadvantageous.

[0014] A multivalent antigen-binding protein has more than oneantigen-binding site. A multivalent antigen-binding protein comprisestwo or more single-chain protein molecules. Enhanced binding activity,di- and multi-specific binding, and other novel uses of multivalentantigen-binding proteins have been demonstrated. See, Whitlow, M., etal., Protein Engng. 7:1017-1026 (1994); Hoogenboom, H.R., NatureBiotech. 15:125-126 (1997); and WO 93/11161.

[0015] Ladner et al. also discloses the use of the single chain antigenbinding molecules in diagnostics, therapeutics, in vivo and in vitroimaging, purifications, and biosensors. The use of the single chainantigen binding molecules in immobilized form, or in detectably labeledforms is also disclosed, as well as conjugates of the single chainantigen binding molecules with therapeutic agents, such as drugs orspecific toxins, for delivery to a specific site in an animal, such as ahuman patient.

[0016] Whitlow et al. (Methods.: A Companion to Methods in Enzymology2(2):97- 105 (June, 1991)) provide a good review of the art of singlechain antigen binding molecules and describe a process for making them.

[0017] In U.S. Pat. 5,091,513, Huston et al. discloses a family ofsynthetic proteins having affinity for preselected antigens. Thecontents of U.S. Pat. 5,091,513 are incorporated by reference herein.The proteins are characterized by one or more sequences of amino acidsconstituting a region that behaves as a biosynthetic antibody bindingsite (BABS). The sites comprise (1) noncovalently associated ordisulfide bonded synthetic V_(H) and V_(L) regions, (2) V_(H)—V_(L) orV_(L)—V_(H) single chains wherein the V_(H) and V_(L) are attached to apolypeptide linker, or (3) individual V_(H) or V_(L) domains. Thebinding domains comprises complementarity determining regions (CDRs)linked to framework regions (FRs), which can be derived from separateimmunoglobulins.

[0018] U.S. Pat. 5,091,513 also discloses that three subregions (theCDRs) of the variable domain of each of the heavy and light chains ofnative immunoglobulin molecules collectively are responsible for antigenrecognition and binding. These CDRs consist of one of the hypervariableregions or loops and of selected amino acids or amino acid sequencesdisposed in the framework regions that flank that particularhypervariable region. It is said that framework regions from diversespecies are effective in maintaining CDRs from diverse other species inproper conformation so as to achieve true immunochemical bindingproperties in a biosynthetic protein.

[0019] U.S. Pat. No. 5,091,513 includes a description of a chimericpolypeptide that is a single chain composite polypeptide comprising acomplete antibody binding site. This single chain composite polypeptideis described as having a structure patterned after tandem V_(H) andV_(L) domains, with a carboxyl terminal of one attached through an aminoacid sequence to the amino terminal of the other. It thus comprises anamino acid sequence that is homologous to a portion of the variableregion of an immunoglobulin heavy chain (V_(H)) peptide bonded to asecond amino acid sequence that was homologous to a portion of thevariable region of an immunoglobulin light chain (V_(L))

[0020] Chen et al., describe the production and use of a fusion proteinconsisting of an antibody Fab fragment and a DNA binding moiety,protamine, to deliver toxin-expressing plasmid DNA into HIV infectedcells by receptor mediated endocytosis (S-Y Chen et al., Gene Therapy 2:116-123 (1995)).

BRIEF SUMMARY OF THE INVENTION

[0021] Accordingly, it is an object of the present invention to providea new and improved delivery system that can introduce foreign genes in anon-toxic, cell specific manner into mammalian cells. Also provided bythe invention is a system and an efficient method that exhibits a highdegree of cell specificity using relatively simple yet reliabledelivery.

[0022] Another feature of the present invention is the use ofreceptor-mediated specificity to provide cell specificity to the genedelivery system. This involves the use of cell-surface receptors asnaturally existing entry mechanisms for the specific delivery of genes.The molecules once recognized and bound to the receptor can beinternalized within the target cell via endocytosis. Included in thisfeature is the provision for a unique carrier comprising a single-chainantigen-binding protein/polynucleotide complex capable of targeting thegene to specific cells possessing particular receptors that recognizethe complex.

[0023] In addition, the carrier of the present invention relates totailed single chain polypeptides containing a basic amino acid richregion (i.e., oligo-lysine, oligo-arginine, or a mixture thereof) andhaving binding affinity for an antigen and the capability of deliveringnucleic acids to a cell and processes for preparing them. Suitablepolypeptides are, for example, those described by Ladner et al. in U.S.Pat. No. 4,946,778 and Huston et al. in U.S. Pat. No. 5,091,513.

[0024] These features provide advantages to the present invention thatdirectly contribute to the efficiency and target specificity of thedelivery system to specific cell types, including normal cells as wellas tumor cells not found in the delivery systems known in the art.

[0025] The present invention is directed to a method of deliveringnucleic acids to a cell comprising:

[0026] (1) providing an a basic amino acid tailed single-chainantigen-binding polypeptide capable of delivering nucleic acids to acell comprising:

[0027] (a) a first polypeptide comprising the antigen binding portion ofthe variable region of an antibody heavy or light chain;

[0028] (b) a second polypeptide comprising the antigen binding portionof the variable region of an antibody heavy or light chain; and

[0029] (c) a peptide linker linking the first and second polypeptides(a) and (b) into a single chain polypeptide having an antigen bindingsite, wherein, at its C-terminus, N-terminus, or both of polypeptide(a), (b) or both, the single-chain antigen-binding polypeptide has anamount of basic amino acid residues sufficient to bind nucleic acids,wherein the basic amino acid residues are selected from the groupconsisting of: Lys, Arg and a combination thereof; and

[0030] wherein the basic amino acid residues binds nucleic acid andwherein the single-chain antigen-binding polypeptide binds antigen;

[0031] (2) allowing a nucleic acid to bind to the basic amino acidresidue containing single-chain antigen-binding polypeptide; and

[0032] (3) transforming a cell with the nucleic acid bound basic aminoacid residue containing single-chain antigen-binding polypeptide.

[0033] More particularly, the invention is directed to a single-chainantigen-binding polypeptide capable of delivering nucleic acids to acell, comprising:

[0034] (a) a first polypeptide comprising the antigen binding portion ofthe variable region of an antibody heavy or light chain;

[0035] (b) a second polypeptide comprising the antigen binding portionof the variable region of an antibody heavy or light chain; and

[0036] (c) a peptide linker linking the first and second polypeptides(a) and (b) into a single chain polypeptide having an antigen bindingsite,

[0037] wherein at its C-terminus, N-terminus, or both of polypeptide(a), (b) or both, the single-chain antigen-binding polypeptide has anamount of basic amino acid residues sufficient to bind nucleic acids,wherein the basic amino acid residues are selected from the groupconsisting of: Lys, Arg and a combination thereof; and wherein the basicamino acid residues binds nucleic acid and wherein the single-chainantigen-binding polypeptide binds antigen. These basic amino acidresidues in the sFv protein (e.g., oligo-lysine sFv) generate a minimalnon-specific nucleic acid binding region. The basic amino acid region isconfigured such that at least 2 to 8 groups of eight consecutiveresidues of Lys, Arg or a combination thereof are separated fromadjacent groups by 0-20 amino acid residues.

[0038] The invention is further directed to a genetic sequence encodinga single-chain antigen-binding polypeptide capable of delivering nucleicacids to a cell, comprising:

[0039] (a) a first polypeptide comprising the antigen binding portion ofthe variable region of an antibody heavy or light chain;

[0040] (b) a second polypeptide comprising the antigen binding portionof the variable region of an antibody heavy or light chain; and

[0041] (c) a peptide linker linking the first and second polypeptides(a) and (b) into a single chain polypeptide having an antigen bindingsite, wherein at its C-terminus, N-terminus, or both of polypeptide (a),(b) or both, the single-chain antigen-binding polypeptide has an amountof basic amino acid residues sufficient to bind nucleic acids, whereinthe basic amino acid residues are selected from the group consisting of:Lys, Arg and a combination thereof, and

[0042] wherein the basic amino acid residues binds nucleic acid andwherein the single-chain antigen-binding polypeptide binds antigen.These basic amino acid residues in the sFv protein (e.g., oligo-lysinesFv) generate a minimal non-specific nucleic acid binding region. Thebasic amino acid region is configured such that at least 2 to 8 groupsof eight consecutive residues of Lys, Arg or a combination thereof areseparated from adjacent groups by 0-20 amino acid residues.

[0043] The nucleic acid is a polynucleotide that can be either DNA orRNA.

[0044] The invention is directed to a replicable cloning or expressionvehicle comprising the above described polynucleotide sequence. Theinvention is also directed to such vehicle which is a plasmid. Theinvention is further directed to a host cell transformed with the abovedescribed DNA. The host cell can be a bacterial cell, a yeast cell orother fungal cell, an insect cell or a mammalian cell line. A preferredhost is Pichia pastoris.

[0045] The invention is directed to a method of producing a single-chainantigen-binding polypeptide capable of delivering nucleic acids to acell, comprising:

[0046] (a) providing a first genetic sequence encoding a firstpolypeptide comprising the antigen binding portion of the variableregion of an antibody heavy or light chain;

[0047] (b) providing a second genetic sequence encoding a secondpolypeptide comprising the antigen binding portion of the variableregion of an antibody heavy or light chain; and

[0048] (c) linking the first and second genetic sequences (a) and (b)with a third genetic sequence encoding a peptide linker into a fourthgenetic sequence encoding a single chain polypeptide having an antigenbinding site, wherein at its C-terminus, N-terminus, or both ofpolypeptide (a), (b) or both, the single-chain antigen-bindingpolypeptide has an amount of basic amino acid residues sufficient tobind nucleic acids, wherein the basic amino acid residues are selectedfrom the group consisting of: Lys, Arg and a combination thereof; and

[0049] wherein the basic amino acid residues binds nucleic acid andwherein the single-chain antigen-binding polypeptide binds antigen;

[0050] (d) transforming a host cell with the fourth genetic sequenceencoding a single-chain antigen-binding polypeptide of (c); and

[0051] (e) expressing the single-chain antigen-binding polypeptide of(c) in the host, thereby producing a single-chain antigen-bindingpolypeptide capable of delivering nucleic acids to a cell.

[0052] The invention is further directed to a multivalent single-chainantigen-binding protein, comprising two or more single-chainantigen-binding polypeptides, each single-chain antigen-bindingpolypeptide comprising:

[0053] (a) a first polypeptide comprising the antigen binding portion ofthe variable region of an antibody heavy or light chain;

[0054] (b) a second polypeptide comprising the antigen binding portionof the variable region of an antibody heavy or light chain; and

[0055] (c) a peptide linker linking the first and second polypeptides(a) and (b) into a single chain polypeptide having an antigen bindingsite, wherein at its C-terminus, N-terminus, or both of polypeptide (a),(b) or both, the single-chain antigen-binding polypeptide has an amountof basic amino acid residues sufficient to bind nucleic acids, whereinthe basic amino acid residues are selected from the group consisting of.Lys, Arg and a combination thereof; and

[0056] wherein the basic amino acid residues binds nucleic acid andwherein the single-chain antigen-binding polypeptide binds antigen.

[0057] In the above described embodiments of the invention, a lysinerich or an oligo-Lys polypeptide sequence of the present invention canbe capable of attaching a polyalkylene oxide moiety wherein thepolyalkylene oxide conjugated oligo-lysine tailed single-chainantigen-binding polypeptide binds an antigen as well as nucleic acids.

[0058] In the above described embodiments of the invention, theC-terminus of the second polypeptide (b) can be the native C-terminus.The C-terminus of the second polypeptide (b) can comprise a deletion ofone or plurality of amino acid residue(s), such that the remainingN-terminus amino acid residues of the second polypeptide are sufficientfor the polypeptide to be capable of binding an antigen. The C-terminusof the second polypeptide can comprise an addition of one or pluralityof amino acid residue(s), such that the polypeptide is capable ofbinding an antigen. Moreover, the nucleic acid binding region can begenerated by mutating one or a plurality of amino acid residue(s) to abasic amino acid residue(s) in the C-terminal or N-terminal regions ofthe polypeptide (a) or (b). In addition, the nucleic acid binding regioncan be generated by inserting blocks of basic amino acids at theC-terminus or N-terminus of the polypeptide (a) or (b).

[0059] In a preferred embodiment of the invention, the first polypeptide(a) can comprise the antigen binding portion of the variable region ofan antibody light chain and the second polypeptide (b) comprises theantigen binding portion of the variable region of an antibody heavychain.

[0060] The invention is also directed to a method for treating atargeted disease, comprising administering an effective amount of acomposition comprising a nucleic acid molecule bound to the polypeptideor protein of the invention and a pharmaceutically acceptable carriervehicle for delivery to a cell.

BRIEF DESCRIPTION OF THE DRAWINGS/FIGURES

[0061]FIG. 1 shows the DNA (SEQ ID NO: 1) and protein sequence (SEQ IDNO: 2) of CC49/218 sFv with an engineered oligo-lysine C-terminal tailsegment. The eight new lysine residues were genetically engineered at aBstEII site and are shown underlined and marked with asterisks. Alsohighlighted are the CDR sequences (double underlined), the 218 linker(underlined and labeled) and selected restriction sites.

[0062]FIGS. 2A and 2B show the DNA (SEQ ID NO: 3) and protein sequence(SEQ ID NO: 4) of CC49/218 sFv with an engineered oligo-lysineC-terminal tail segment. The sixteen new lysine residues weregenetically engineered at a BstEII site and are shown underlined andmarked with asterisks. Also highlighted are the CDR sequences (doubleunderlined), the 218 linker (underlined and labeled) and selectedrestriction sites.

[0063]FIG. 3 shows the protein sequence (SEQ ID NO: 5) of A33/218 sFvwith engineered oligo-lysine C-terminal tail segment. The sixteen newlysine residues are marked with asterisks. Also highlighted are the CDRsequences (double underline) and the 218 linker (overlined and labeled).

[0064]FIG. 4 shows DNA binding by A33/218 SCA with an engineered 16lysine C-terminal tail using gel shift assay: lane 1 is a BSA control,lane 2 is a GS115 culture supernatant control and lanes 3-12 have 0, 5,10, 15, 20, 30, 40, 50, 60, 70 and 80 μl, respectively of dialyzedculture supernatant of the 16 lysine SCA protein.

[0065]FIG. 5 shows the Coomassie Blue stained SDS-PAGE gel of purifiedCC49-16K 266(7). Lane 1, molecular weight markers; Lane 2, purifiednative CC49/218 sFv; Lane 3, EN266(7) fermentation cell pellet; Lane 4,EN266(7) sFv released from Lane 3 material by a high salt wash (1.5 MNaCl, 20 mM Tris-HCl, pH 8.0, at room temperature for 2 hours).

[0066]FIG. 6 shows an ELISA assay demonstrating retention ofmucin-binding activity of the CC49-16K sFv EN266(7).

[0067]FIGS. 7A and 7B show the results of the transfection of LS174-Tcells by reporter plasmid pSEAP using CC49-16K sFv as carrier.

[0068]FIG. 8 shows the sequence for Kabat Consensus V_(K)I/218/V_(H)IIIsFv (SEQ ID NO: 6). The sixteen new lysine residues are marked withasterisks. CDR sequences are double underlined.

[0069]FIG. 9 shows the sequence for C6.5/218 sFv (SEQ ID NO: 7). Thesixteen new lysine residues are marked with asterisks. CDR sequences aredouble underlined.

DETAILED DESCRIPTION OF THE INVENTION

[0070] The present invention is directed to the novel combination of asingle-chain polypeptide containing basic amino acid region (e.g.,regions rich in basic amino acids, oligo-lysine, oligo-arginine orcombination thereof) and having 1) capability to bind nucleic acids; and2) binding affinity for an antigen, such that the polypeptide is capableof delivering nucleic acids to cells. The present invention is alsodirected to a method of delivering nucleic acids to cells using a basicamino acid tailed single chain polypeptide. Furthermore, the foregoingdesign could applied not only to sFvs but also to V_(H) single domains,disulfied-stabilized Fv, Fabs or Mabs.

[0071] Gene Delivery Methods

[0072] The present invention provides a novel delivery system that canintroduce foreign genes in a non-toxic, cell specific manner intomammalian cells ex vivo or in vivo. Also provided by the invention is asystem and method that exhibits a high degree of cell specificity usingrelatively simple yet reliable delivery.

[0073] The present invention uses a receptor-mediated specificity toprovide cell specificity to the gene delivery system. This involves theuse of cell-surface receptors or antigens as naturally existing entrymechanisms for the specific delivery of genes. Included in this featureis the provision for a unique basic amino acid tailed single-chainantigen-binding polypeptide polynucleotide complex capable of targetingthe gene to specific cells possessing particular receptors or antigensthat are recognized by the complex. Cell specificity can be achieved byselecting a single-chain antigen-binding protein that has a bindingaffinity for the cell type to be targeted for gene delivery. Forexample, anti-tumor single-chain antigen-binding protein can be used totarget gene delivery to specific tumor cells. Also, anti-fluoresceinsingle-chain antigen-binding proteins can be used to target fluoresceinlabeled cells. Thus, the skilled artisan could readily target any celltype by selecting a single-chain antigen-binding protein having anappropriate affinity for the targeted cell.

[0074] In addition, the cell specificity for the targeted delivery ofnucleic acids can be achieved or enhanced by including “translocationdomains” in the sFvs of the present invention. The use of the exotoxin A“translocation domain” has been demonstrated to facilitate efficient DNAtransfer in non-viral DNA delivery systems. See, Fominaya et al. J Biol.Chem. 271: 10560 (1986); and WO 96/13599, incorporated by reference).Also, nucleus targeting peptide fusions have demonstrated enhanceddelivery of DNA to the nucleus in non-viral DNA delivery systems. (See,Avrameas et al. Proc. Natl. Acad. Sci. 95: 5601-5606 (1998)). Thus, theskilled artisan could readily further enhance the efficiency nucleicacid delivery to a target cell type by including “translocation domain”and or a nucleus targeting peptide within a single-chain antigen-bindingprotein which has an appropriate affinity for the targeted cell.

[0075] The present invention is directed to a method of deliveringnucleic acids to a cell comprising:

[0076] (1) providing an basic amino acid tailed single-chainantigen-binding polypeptide capable of delivering nucleic acids to acell comprising:

[0077] (a) a first polypeptide comprising the antigen binding portion ofthe variable region of an antibody heavy or light chain;

[0078] (b) a second polypeptide comprising the antigen binding portionof the variable region of an antibody heavy or light chain; and

[0079] (c) a peptide linker linking the first and second polypeptides(a) and (b) into a single chain polypeptide having an antigen bindingsite, wherein at its C-terminus, N-terminus, or both of polypeptide (a),(b) or both, the single-chain antigen-binding polypeptide has an amountof basic amino acid residues sufficient to bind nucleic acids, whereinthe basic amino acid residues are selected from the group consisting of:Lys, Arg and a combination thereof; and

[0080] wherein the basic amino acid residues binds nucleic acid andwherein the single-chain antigen-binding polypeptide binds antigen;

[0081] (2) allowing a nucleic acid to bind to the basic amino acidtailed single-chain antigen-binding polypeptide; and

[0082] (3) transforming a cell with the nucleic acid bound basic aminoacid tailed single-chain antigen-binding polypeptide.

[0083] The invention also provides for the use of the basic amino acidtailed sFv proteins (i.e., oligo-Lys, oligo-Arg or oligo combination oflys and arg residues; or regions rich in lys and/or arg residues) in aprocess for targeted gene therapy. The invention further provides forsFv proteins having an amount of basic amino acid residues sufficient tobind nucleic acids. More specifically, the invention provides for sFvproteins having at least 10, 12, 14 or 16 lysines in the C-terminalregion of the sFv which bind nucleic acids wherein the lysine residuesare configured in two groups of eight consecutive lysine residuesseparated by 0-20 amino acid residues. A 16 lysine C-terminal tailed SCAprotein complexed to a nucleic acid construct capable of expressing aprotein, can be used to deliver such nucleic acid constructs to aspecific cell type for (1) transient expression of the protein or (2)allow for the DNA construct to be inserted safely into the genome andhave the expression be regulated by normal cellular signals. To targetthe nucleic acid delivery to a specific cell type, an SCA protein isselected that will bind to and be internalized by that cell type. Manysuch SCA proteins are well known to those skilled in the art. Forexample, anti-tumor SCAs can be modified to have a 16 lysine C-terminaltail according to the present invention. These anti-tumor SCAs can beused to carry DNA encoding toxins or chemotherapeutic proteins that,when internalized and expressed, will cause the death of the tumor cell.

[0084] Furthermore, PEGylating the oligo-lysine containing SCA proteinor according to the methods disclosed in U.S. application Ser. No.09/069,842, filed on Apr. 30, 1998 (incorporated by reference in itsentirety), will also provide protection from degradation for thecomplexed nucleic acid since the modified SCA proteins have reducedimmunogenicity and antigenicity as well a longer half-life in thebloodstream. In addition, an SCA protein having one or more lysineresidues in a basic amino acid rich region will allow for site specificPEGylation at the lysine residue(s).

[0085] As indicated above, the single-chain antigen-binding polypeptideshave a nucleic acid binding region comprising a sufficient amount ofbasic amino acids to bind nucleic acids. This region can comprise asequence that is rich in basic amino acids such as lysine, arginine andcombinations thereof. This region will contain enough basic amino acidsto obtain the requisite overall positive charge on the sFv for nucleicacid binding. These nucleic acid binding regions can be at theC-terminal region, N-terminal region or both of the sFv. The nucleicacid binding regions can be generated by mutating one or a plurality ofamino acid residue(s) of the sFv or by adding a block of basic aminoacid residues to the C-terminal region, N-terminal region or both of thesFv. Furthermore, the foregoing design could be applied not only to sFvsbut also to V_(H) single domains, disulfide-stabilized Fv, Fabs or Mabs.

[0086] Preferably, the single-chain antigen-binding polypeptideaccording to the present invention has an amount of oligo-Lys, oligo-Argor oligo-Lys/Arg residues sufficient to bind nucleic acids. Preferably,the nucleic acid binding region of single-chain antigen-bindingpolypeptide comprises at least 2 to 8 groups of eight consecutive Lysresidues, Arg residues or a combination thereof, wherein each group ofeight consecutive lysine, arg or lys/arg residues is separated fromadjacent groups by 0-20 amino acid residues. More preferably, thenucleic acid binding region of the single-chain antigen-bindingpolypeptide comprises at least 2 to 6 groups of eight consecutive Lysresidues, Arg residues or a combination thereof, wherein each group ofeight consecutive lysine, arg or lys/arg residues is separated fromadjacent groups by 0-20 amino acid residues. Still more preferably, thenucleic acid binding region of the single-chain antigen-bindingpolypeptide comprises at least 2 to 4 groups of eight consecutive Lysresidues, Arg residues or a combination thereof, wherein each group ofeight consecutive lysine, arg or lys/arg residues is separated fromadjacent groups by 0-20 amino acid residues. More preferably, thenucleic acid binding region of the single-chain antigen-bindingpolypeptide comprises at least 2 to 3 groups of eight consecutive Lysresidues, Arg residues or a combination thereof, wherein each group ofeight consecutive lysine, arg or lys/arg residues is separated fromadjacent groups by 0-20 amino acid residues. Still more preferably, thenucleic acid binding region of the single-chain antigen-bindingpolypeptide has at least 2 groups of eight consecutive Lys residues, Argresidues or a combination thereof, wherein each group of eightconsecutive lysine, arg or lys/arg residues is separated from adjacentgroups by 0-20 amino acid residues.

[0087] The nucleic acid binding regions of the single-chainantigen-binding polypeptide of the present invention can be representedby the following formulas: 1) (KKKKKKKK)m (X)n (KKKKKKKK) (SEQ ID NO:8), wherein K is lysine, m is an integer between 1 and 7 and n is aninteger between 0 and 20; 2) (RRRRRRRR)m (X)n (RRRRRRRR) (SEQ ID NO: 9),wherein R is Arginine, m is an integer between 1 and 7 and n is aninteger between 0 and 20; 3) (RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO:10), wherein K is lysine R is arginine such that the K and R residueseither alternate or are in random order, m is an integer between 1 and 7and n is an integer between 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK)(SEQ ID NO: 11), wherein K is lysine, R is arginine, m is an integerbetween 1 and 7 and n is an integer between 0 and 20. Preferably, thenucleic acid binding regions of the single-chain antigen-bindingpolypeptide of the present invention can be represented by the followingformulas:1) (KKKKKKKK)m (X)n (KKKKKKKK) (SEQ ID NO: 8), wherein K islysine, m is an integer between 1 and 5 and n is an integer between 0and 20; 2) (RRRRRRRR)m (X)n (RRRRRRRR) (SEQ ID NO: 9), wherein R isArginine, m is an integer between 1 and 5 and n is an integer between 0and 20; 3) (RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO: 10), wherein K islysine R is arginine such that the K and R residues either alternate orare in random order, m is an integer between 1 and 5 and n is an integerbetween 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK) (SEQ ID NO: 11),wherein K is lysine, R is arginine, m is an integer between 1 and 5 andn is an integer between 0 and 20.

[0088] More preferably, the nucleic acid binding regions of thesingle-chain antigen-binding polypeptide of the present invention can berepresented by the following formulas: 1) (KKKKKKKK)m (X)n (KKKKKKKK)(SEQ ID NO: 8), wherein K is lysine, m is an integer between 1 and 3 andn is an integer between 0 and 20; 2) (RRRRRRR)m (X)n (RRRRRRRR) (SEQ IDNO: 9), wherein R is Arginine, m is an integer between 1 and 3 and n isan integer between 0 and 20; 3) (RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO:10), wherein K is lysine R is arginine such that the K and R residueseither alternate or are in random order, m is an integer between 1 and 3and n is an integer between 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK)(SEQ ID NO: 11), wherein K is lysine, R is arginine, m is an integerbetween 1 and 3 and n is an integer between 0 and 20.

[0089] Still more preferably, the nucleic acid binding regions of thesingle-chain antigen-binding polypeptide of the present invention can berepresented by the following formulas: 1) (KKKKKKKK)m (X)n (KKKKKKKK)(SEQ ID NO: 8), wherein K is lysine, m is an integer between 1 and 2 andn is an integer between 0 and 20; 2) (RKRRKRRR)m (X)n (RRRRRRRR) (SEQ IDNO: 9), wherein R is Arginine, m is an integer between 1 and 2 and n isan integer between 0 and 20; 3) (RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO:10), wherein K is lysine R is arginine such that the K and R residueseither alternate or are in random order, m is an integer between 1 and 2and n is an integer between 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK)(SEQ ID NO: 11), wherein K is lysine, R is arginine, m is an integerbetween 1 and 2 and n is an integer between 0 and 20. More preferably,the DNA binding regions of the single-chain antigen-binding polypeptideof the present invention can be represented by the followingformulas: 1) (KKKKKKKK)m (X)n (KKKKKKKK) (SEQ ID NO: 8), wherein K islysine, in is 1 and n is an integer between 0 and 20; 2) (RRRRRRRR)m(X)n (RRRRRRRR) (SEQ ID NO: 9), wherein R is Arginine, m is 1 and n isan integer between 0 and 20; 3) (RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO:10), wherein K is lysine R is arginine such that the K and R residueseither alternate or are in random order, m is I and n is an integerbetween 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK) (SEQ ID NO: 11),wherein K is lysine, R is arginine, m is 1 and n is an integer between 0and 20.

[0090] Even more preferably, the single-chain antigen-bindingpolypeptide has the basic amino acid residue rich region, oligo-lysineresidues, oligo-arginine residues or combination thereof, configuredsuch that the number of groups of Lys and or Arg residues is no higherthan that which would result in an unstable polynucleotide encoding thesingle-chain antigen-binding polypeptide or significantly reduce theefficiency of translation of the polynucleotide encoding thesingle-chain antigen-binding polypeptide or interfere with antigenbinding.

[0091] The preferred ratio of single-chain antigen-binding polypeptideto nucleic acid are as follows: 10,000:1, 2000:1, 1000:1, 500:1, 250:1or 100:1 (molar ratio of sFv: DNA). Of course, these ranges areexemplary only and one of skill in the art could readily optimize theratios for optimal binding and transfection results for any particularcell or tissue type.

[0092] The optimal conditions for complexing the nucleic acids with thesFv of the present invention is as follows. Preferably, the complexingis performed in a complexing buffer containing 0-100 mM Tris-HCl (or anyequivalent buffer), 5-500 mM NaCl, pH 6-9. More preferably, thecomplexing is performed in a buffer containing 10 mM Tris-HCl, 150 mMNaCl, pH 7.5. The time and temperature for complexing the nucleic acidswith the sFv in the foregoing complexing buffer are 1 to 120 minutes at4-40° C. The preferred time and temperature for complexing the nucleicacids with the sFv in the complexing buffer are 15 minutes at roomtemperature or about 22° C.

[0093] Chen et al. described the production and use of a fusion proteinconsisting of an antibody Fab fragment and a DNA binding moiety,protamine, to deliver toxin-expressing plasmid DNA into HIV infectedcells by receptor mediated endocytotsis. S-Y Chen et al., (1995) GeneTherapy 2:116-123. The present invention, however, has advantages overthe Fab-protamine fusion peptide for delivering DNA into cells asdisclosed by Chen et al. This is because, protamine or poly lysinedomains (of a 100 lysine residues or more, i.e., “100K”) are very tightbinders of DNA compared to the “16K” tail of the present invention. Thepresent inventors, however, have discovered that the oligo lysineconfiguration of the present invention having 16 lysines (“16K”) placedwithin a short C-terminal extension from the sFv behaves as a minimalnucleic acid-binding domain. (See, for example, FIGS. 2A, 2B and 3 andExamples 1 and 3). This is because the natural DNA binding protein likeprotamine interacts with DNA presumably by electrostatic interactions,hydrogen bonding, hydrophobic bonds, Van der Waals bonds, and overallshape complementarity. However, in contrast to most natural DNA bindingproteins, the sFv containing a basic amino acid rich region according tothe present invention is proposed to bind and complex with nucleic acidsessentially through electrostatic interactions and interpolyelectrolytecomplex chemistry.

[0094] Moreover, Pardridge et al (J. Pharma. and ExperimetalTherapeutics 286:548-554 (1998)) have shown that “cationization”promotes endocytosis of Mabs. Thus, the sFv having a basic amino acidrich region according to the present invention can be more readily takenup by endocytosis due an increased positive charge of the sFv. Anincrease in endocytosis is expected to result in increased transfectionefficiencies and expression of the nucleic acids that are complexed withthe sFv of the present invention.

[0095] The use of sFv fused to a minimal nucleic acid-binding domain canalso have production advantages. Although, proteins such as protamine orpolylysine having 100 or more lysines can be more effective atcondensing DNA, the expected reduction in affinity of the 16K tail forDNA, relative to protamine, will have the advantage of releasing thenucleic acid more efficiently from the sFv once targeting has beenachieved, thereby allowing this nucleic acid to be expressed by thecell. Thus, the 16K tailed sFv of the present can be a more effectivenucleic acid delivery vehicle than the Fab-protamine or Fab-polylysine(100K) synthetic polypeptides disclosed in the art. The oligo-lysine oroligo-arginine tail strategy of the present invention can be amenable toPEGylation as discussed, supra, which results in a DNA delivery carrierwith reduced immunogenicity and increased half-life.

[0096] As shown in Examples 5 and 6, below, the 16K sFv of the presentinvention can be employed as a targeting molecule to enhancetransfection of specific cells in culture. The demonstration of DNAdelivery to cultured cells by in situ immunochemistry shows that the SCAmolecule of the present invention can accomplish specific targeting,even for targets that are non-internalizing. In addition, transfectionwas also shown to be markedly enhanced by the oligo-lysine sFv of thepresent invention. Since the oligo-lysine sFv of the present inventionhas demonstrated to be successful in transfecting targets that are noninternalizing, it is anticipated that the enhanced specific transfectionof an internalizing target should also be achievable.

[0097] The nucleic acid used in the present invention can have atherapeutic effect on the target cell, the effect selected from, but notlimited to, correcting a defective gene or protein, a drug action, atoxic effect, a growth stimulating effect, a growth inhibiting effect, ametabolic effect, a catabolic affect, an anabolic effect, an antiviraleffect, an antibacterial effect, a hormonal effect, a neurohumoraleffect, a cell differentiation stimulatory effect, a celldifferentiation inhibitory effect, a neuromodulatory effect, anantineoplastic effect, an anti-tumor effect, an insulin stimulating orinhibiting effect, a bone marrow stimulating effect, a pluripotent stemcell stimulating effect, an immune system stimulating effect, and anyother known therapeutic effects that can be provided by a therapeuticagent delivered to a cell via a delivery system according to the presentinvention.

[0098] The sFv conjugate of the present invention can be used forprotection, suppression or treatment of infection or disease. By theterm “protection” from infection or disease as used herein is intended“prevention,” “suppression” or “treatment.” “Prevention” involvesadministration of a sFv conjugate prior to the induction of the disease.“Suppression” involves administration of the composition prior to theclinical appearance of the disease.

[0099] “Treatment” involves administration of the protective compositionafter the appearance of the disease. It will be understood that in humanand veterinary medicine, it is not always possible to distinguishbetween “preventing” and “suppressing” since the ultimate inductiveevent or events can be unknown, latent, or the patient is not determineduntil well after the occurrence of the event or events. Therefore, it iscommon to use the term “prophylaxis” as distinct from “treatment” toencompass both “preventing” and “suppressing” as defined herein. Theterm “protection,” as used herein, is meant to include “prophylaxis.”

[0100] Further, essentially all of the uses for which monoclonal orpolyclonal antibodies, or fragments thereof, have been envisioned by theprior art, can be addressed by the oligo-lysine tailed sFv proteins ofthe present invention. See, e.g., Kohler et al., Nature 256:495 (1975);Kohler et al, Eur. J Immunol. 6:511 (1976); Kohler et al., Eur. J.Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies andT-Cell Hybridomas, pp.563-681, Elsevier, N (1981); Sambrook et al.,Molecular Cloning—A Laboratory Manual, 2 nd ed., Cold Spring HarborLaboratory (1989).

[0101] The gene delivery system of the present invention can be used forany host. Preferably, the host will be a mammal. Preferred mammalsinclude primates such as humans and chimpanzees. domestic animals suchas horses, cows, pigs, dogs, and cats. More preferably, the host animalis a primate or domestic animal. Still more preferably, the host animalis a primate such as a human.

[0102] Because humans are the desired hosts for in vivo delivery,certain test models have been developed and accepted by the field todetermine the efficacy and utility of a delivery system. This involvesin vitro testing, ex vivo testing and use of marker genes. Thus, thesusceptibility of a cell to gene delivery by the method of the presentinvention can be determined by assays for a reporter gene. A marker genesuch as that encoding β-galactosidase (β-gal), chloramphenicol acetyltransferase (CAT), etc. is used for convenience to determine whether aprotein can be expressed in a particular recombinant construct deliveredby the present method. In addition, the quantity and duration ofexpression can be assayed. The use of, for example, neomycin resistanceto determine the efficacy of gene delivery has been described in humantesting with the desired gene. Thus, the skilled artisan, based on thisdisclosure can readily determine the efficacy of delivery of aparticular vector construct in a particular target tissue and host usingthe method of the present invention.

[0103] The genetic material (nucleic acids) that is delivered to thetarget cell using the method of the present invention can be genes, forexample, those that encode a variety of proteins including anticancerand antiviral agents. Such genes include those encoding varioushormones, growth factors, enzymes, cytokines, receptors, MHC moleculesand the like. The term “genes” includes nucleic acid sequences bothexogenous and endogenous to cells into which the vector containing thegene of interest can be introduced.

[0104] Of particular interest for use in gene delivery are those genesencoding polypeptides either absent, produced in diminished quantitiesor produced in a mutant form in individuals suffering from a geneticdisease. Such genetic diseases include retinoblastoma, Wilms tumor,adenosine deaminase deficiency (ADA), thalassemias, cystic fibrosis,Sickle cell disease, Huntington's disease, Duchenne's musculardystrophy, Phenylketonuria, Lesch-Nyhan syndrome, Gaucher's disease,Tay-Sach's disease, and the like.

[0105] Additionally, it is of interest to use genes encoding tumorsuppressor genes (e.g., retinoblastoma gene), TNF, TGF-β, TGF-α,hemoglobin, interleukins, GM-CSF, G-CSF, M-CSF, human growth hormone,co-stimulatory factor B7, insulin, factor VIII, factor IX, PDGF, EGF,NGF, EPO, β-globin and the like, as well as biologically active muteinsof these proteins. Genes for delivery to target cells can be from avariety of species; however, preferred species sources for genes ofinterest are those species into which the gene of interest is to beinserted using the method of the present invention.

[0106] The gene can further encode a product that regulates expressionof another gene product or blocks one or more steps in a biologicalpathway, such as the sepsis pathway. In addition, the gene can encode atoxin fused to a polypeptide, e.g., a receptor ligand or an antibodythat directs the toxin to a target such as a tumor cell or a virus.Similarly, the gene can encode a protein that provides a therapeuticeffect to a diseased tissue or organ.

[0107] Basic techniques for operably inserting genes into expressionvectors are known to those skilled in the art. See, Sambrook et al.,MOLECULAR CLONING: A LABORATORY MANUAL, 2 nd ed., Cold Spring HarborLaboratory, Cold Spring Harbor, N.Y. (1989); Ausubel et al. (eds.),CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley and Sons (1987), bothincorporated herein by reference.

[0108] Several possible vector systems are available for the expressionof the gene in the mammalian cell that has been transformed according tothe method of the present invention expression. These vector systems arewell known to those skilled in the art. For example, one class ofvectors utilize DNA elements which provide autonomously replicatingextra-chromosomal plasmids, derived from animal viruses such as bovinepapilloma virus, polyoma virus, adenovirus, or SV40 virus. A secondclass of vectors relies upon the integration of the desired genesequences into the host chromosome. Cells which have stably integratedthe introduced DNA into their chromosomes can be selected by alsointroducing one or more markers which allow selection of host cellswhich contain the expression vector. The marker can provide forprototrophy to an auxotrophic host, biocide resistance, e.g.,antibiotics, or resistance to heavy metals, such as copper or the like.The selectable marker gene can either be directly linked to the DNAsequences to be expressed, or introduced into the same cell byco-transformation. Additional elements can also be needed for optimalsynthesis of mRNA. These elements can include splice signals, as well astranscription promoters, enhancers, and termination signals. The cDNAexpression vectors incorporating such elements include those describedby Okayama, H., Mol. Cell. Biol. 3:280 (1983), and others.

[0109] Single Chain Polypeptides

[0110] The invention relates to the discovery that single-chainantigen-binding proteins (“SCA”) or single-chain variable fragments ofantibodies (“sFv”) having basic amino acid tails, have significantutility beyond that of the non basic amino acid tailed single-chainantigen-binding proteins. In addition to maintaining an antigen bindingsite, this SCA protein has a basic amino acid region, such asoligo-lysine or oligo arginine or a combination thereof, according tothe present invention as disclosed, supra, at the C-terminus orN-terminus which is capable of non-specific nucleic acid binding thusenabling the basic amino acid tailed SCA polypeptide to act as a carrierto deliver nucleic acid to cells. Accordingly, the invention is directedto monovalent and multivalent SCA proteins having an oligo-lysine tail,compositions of monovalent and multivalent basic amino acid tailed SCAproteins, methods of making and purifying monovalent and multivalentbasic amino acid tailed SCA proteins, and uses for the basic amino acidtailed SCA proteins. The invention is also directed to SCA proteinscontaining basic amino acid regions having a diagnostic or therapeuticagent bound to the basic amino acid linked polypeptide.

[0111] The terms “single-chain antigen-binding molecule” (SCA) or“single-chain Fv” (sFv) are used interchangeably. They are structurallydefined as comprising the binding portion of a first polypeptide fromthe variable region of an antibody V_(L) (or V_(H)), associated with thebinding portion of a second polypeptide from the variable region of anantibody V_(H) (or V_(L)), the two polypeptides being joined by apeptide linker linking the first and second polypeptides into a singlepolypeptide chain, such that the first polypeptide is N-terminal to thelinker and second polypeptide is C-terminal to the first polypeptide andlinker. The single polypeptide chain thus comprises a pair of variableregions connected by a polypeptide linker. The regions can associate toform a functional antigen-binding site, as in the case wherein theregions comprise a light-chain and a heavy-chain variable region pairwith appropriately paired complementarity determining regions (CDRs). Inthis case, the single-chain protein is referred to as a “single-chainantigen-binding protein” or “single-chain antigen-binding molecule.”

[0112] Single-chain Fvs can and have been constructed in several ways.Either V_(L) is the N-terminal domain followed by the linker and V_(H)(a V_(L) -Linker-V_(H) construction) or V_(H) is the N-terminal domainfollowed by the linker and V_(L) (V_(H) -Linker-V_(L) construction). Thepreferred embodiment contains V_(L) in the N-terminal domain (see,Anand, N.N., et al., J Biol. Chem. 266:21874-21879 (1991)).Alternatively, multiple linkers have also been used. Several types ofsFv proteins have been successfully constructed and purified, and haveshown binding affinities and specificities similar to the antibodiesfrom which they were derived.

[0113] A description of the theory and production of single-chainantigen-binding proteins is found in Ladner et al., U.S. Pat. Nos.4,946,778, 5,260,203, 5,455,030 and 5,518,889, and in Huston et al.,U.S. Pat. No. 5,091,513 (“biosynthetic antibody binding sites” (BABS)),all incorporated herein by reference. The single-chain antigen-bindingproteins produced under the process recited in the above patents havebinding specificity and affinity substantially similar to that of thecorresponding Fab fragment.

[0114] Typically, the Fv domains have been selected from the group ofmonoclonal antibodies known by their abbreviations in the literature as26-10, MOPC 315, 741F8, 520C9, McPC 603, D1.3, murine phOx, human phOx,RFL3.8 sTCR, 1A6, Se155-4,18-2-3,4-4-20, 7A4-1, B6.2, CC49,3C2,2 c,MA-15C5/K₁₂G₀, Ox, etc. (see, Huston, J. S. et al., Proc. Natl. Acad.Sci. USA 85:5879-5883 (1988); Huston, J. S. et al., SIM News 38(4)(Supp.):11 (1988); McCartney, J. et al., ICSU Short Reports 10:114(1990); McCartney, J. E. et al., unpublished results (1990); Nedelman,M. A. et al., J Nuclear Med. 32 (Supp.):1005 (1991); Huston, J. S. etal., In: Molecular Design and Modeling. Concepts and Applications, PartB, edited by J. J. Langone, Methods in Enzymology 203:46-88 (1991);Huston, J. S. et al., In: Advances in the Applications of MonoclonalAntibodies in Clinical Oncology, Epenetos, A. A. (Ed.), London, Chapman& Hall (1993); Bird, R. E. et al., Science 242:423-426 (1988); Bedzyk,W. D. et al., J Biol. Chem. 265:18615-18620 (1990); Colcher, D. et al.,J Nat. Cancer Inst. 82:1191-1197 (1990); Gibbs, R. A. et al., Proc.Natl. A cad. Sci. USA 88:4001-4004 (1991); Milenic, D. E. et al., CancerResearch 51:6363-6371 (1991); Pantoliano, M. W. et al., Biochemistry30:10117-10125 (1991); Chaudhary, V. K. et al., Nature 339:394-397(1989); Chaudhary, V. K. et al., Proc. Natl. Acad. Sci. USA 87:1066-1070(1990); Batra, J. K. et al., Biochem. Biophys. Res. Comm. 171:1-6(1990); Batra, J. K. et al., J Biol. Chem. 265:15198-15202 (1990);Chaudhary, V. K. et al., Proc. Natl. Acad. Sci. USA 87:9491-9494 (1990);Batra, J. K. et al., Mol. Cell. Biol. 11:2200-2205 (1991); Brinkmann, U.et al., Proc. Natl. Acad. Sci. USA 88:8616-8620 (1991); Seetharam, S. etal., J Biol. Chem. 266:17376-17381 (1991); Brinkmann, U. et al., Proc.Natl. Acad. Sci. USA 89:3075-3079 (1992); Glockshuber, R. et al.,Biochemistry 29:1362-1367 (1990); Skerra, A. et al., Bio/Technol.9:273-278 (1991); Pack, P. et al., Biochemistry 31:1579-1534 (1992);Clackson, T. et al., Nature 352:624-628 (1991); Marks, J. D. et al.,J.Mol. Biol. 222:581-597 (1991); Iverson, B. L. et al., Science249:659-662 (1990); Roberts, V. A. et al., Proc. Natl. Acad. Sci. USA87:6654-6658 (1990); Condra, J. H. et al., J Biol. Chem. 265:2292-2295(1990); Laroche, Y. et al., J Biol. Chem. 266:16343-16349 (1991);Holvoet, P. et al., J. Biol. Chem. 266:19717-19724 (1991); Anand, N. N.et al, J. Biol. Chem. 266:21874-21879 (1991); Fuchs, P. et al.,Bio/Technol. 9:1369-1372 (1991); Breitling, F. et al., Gene 104:104-153(1991); Seehaus, T. et al., Gene 114:235-237 (1992); Takkinen, K. etal., Protein Engng. 4:837-841 (1991); Dreher, M. L. et al., J. Immunol.Methods 139:197-205 (1991); Mottez, E. et al., Eur. J. Immunol.21:467-471 (1991); Traunecker, A. et al., Proc. Natl. Acad. Sci. USA88:8646-8650 (1991); Traunecker, A. et al., EMBO J. 10:3655-3659 (1991);Hoo, W. F. S. et al, Proc. Natl. Acad. Sci. USA 89:4759-4763 (1993)).

[0115] Linkers of the invention used to construct sFv polypeptides aredesigned to span the C-terminus of V_(L) (or neighboring site thereof)and the N-terminus of V_(H) (or neighboring site thereof). The preferredlength of the peptide linker should be from 2 to about 50 amino acids.In each particular case, the preferred length will depend upon thenature of the polypeptides to be linked and the desired activity of thelinked fusion polypeptide resulting from the linkage. Generally, thelinker should be long enough to allow the resulting linked fusionpolypeptide to properly fold into a conformation providing the desiredbiological activity. Where conformational information is available, asis the case with sFv polypeptides discussed below, the appropriatelinker length can be estimated by consideration of the 3-dimensionalconformation of the substituent polypeptides and the desiredconformation of the resulting linked fusion polypeptide. Where suchinformation is not available, the appropriate linker length can beempirically determined by testing a series of linked fusion polypeptideswith linkers of varying lengths for the desired biological activity.Such linkers are described in detail in WO 94/12520, incorporated hereinby reference.

[0116] Preferred linkers used to construct sFv polypeptides have between10 and 30 amino acid residues. The linkers are designed to be flexible,and it is recommended that an underlying sequence of alternating Gly andSer residues be used. To enhance the solubility of the linker and itsassociated single chain Fv protein, three charged residues can beincluded, two positively charged lysine residues (K) and one negativelycharged glutamic acid residue (E). Preferably, one of the lysineresidues is placed close to the N-terminus of V_(H), to replace thepositive charge lost when forming the peptide bond of the linker and theV_(H). Such linkers are described in detail in U.S. patent applicationSer. No. 08/224,591, filed Apr. 7, 1994, incorporated herein byreference. See also, Whitlow, M., et al., Protein Engng. 7:1017-1026(1994). It should also be noted that a basic amino acid region havinglysine and arginine residues could also be used in the linker for thesFv polypeptides of the present invention.

[0117] For multivalent sFvs, the association of two or more sFvs isrequired for their formation. Although, multivalent sFvs can be producedfrom sFvs with linkers as long as 25 residues, they tend to be unstable.Holliger, P., et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993),have recently demonstrated that linkers 0 to 15 residues in lengthfacilitate the formation of divalent Fvs. See, Whitlow, M., et al.,Protein Engng. 7:1017-1026 (1994); Hoogenboom, H. R., Nature Biotech.15:125-126(1997). Such multivalent sFvs are described in detail in WO93/11161, herein incorporated by reference.

[0118] Furthermore, single-chain and multivalent immunoeffectorantigen-binding fusion proteins have also been designed and constructed.Such single-chain and multivalent immunoeffector antigen-binding fusionproteins provide the binding capability of the antigen binding proteincombined with the immunoeffector or cytolytic function fusion partner,such as TNF, PLAP, IL-2, GM-CSF and the like. Such single-chain andmultivalent immunoeffector antigen-binding fusion proteins are describedin detail in U.S. Pat. No. 5,763,733, incorporated by reference.

[0119] The object of the present invention is to produce an sFv having anucleic acid binding region comprising basic amino acid residues. Thenucleic acid binding region can comprise a sequence that is rich inbasic amino acids such as lysine, arginine and combinations thereof.This region will contain enough basic amino acids to obtain therequisite overall positive charge on the sFv for nucleic acid binding.These nucleic acid binding regions can be at the C-terminal region,N-terminal region or both of the sFv. The nucleic acid binding regionscan be generated by mutating one or a plurality of amino acid residue(s)or by adding a block of basic amino acid residues to the C-terminalregion, N-terminal region or both of the sFv. The sFv can have a regionrich in basic amino acids, an oligo-Lys, oligo-Arg or combinationthereof as a tail such that the basic amino acid rich region, oligo-Lysor oligo-Arg residues are sufficient to bind nucleic acids and thepolypeptide binds an antigen (i.e., the polypeptide's ability to bind anantigen is not disrupted).

[0120] Preferably, the nucleic acid binding region of the single-chainantigen-binding polypeptide comprises at least 2 to 8 groups of eightconsecutive Lys residues, Arg residues or a combination thereof, whereineach group of eight consecutive lysine, arg or lys/arg residues isseparated from adjacent groups by 0-20 amino acid residues. Morepreferably, the nucleic acid binding region of the single-chainantigen-binding polypeptide comprises at least 2 to 6 groups of eightconsecutive Lys residues, Arg residues or a combination thereof, whereineach group of eight consecutive lysine, arg or lys/arg residues isseparated from adjacent groups by 0-20 amino acid residues. Still morepreferably, the nucleic acid binding region of the single-chainantigen-binding polypeptide comprises at least 2 to 4 groups of eightconsecutive Lys residues, Arg residues or a combination thereof, whereineach group of eight consecutive lysine, arg or lys/arg residues isseparated from adjacent groups by 0-20 amino acid residues. Morepreferably, the nucleic acid binding region of the single-chainantigen-binding polypeptide comprises at least 2 to 3 groups of eightconsecutive Lys residues, Arg residues or a combination thereof, whereineach group of eight consecutive lysine, arg or lys/arg residues isseparated from adjacent groups by 0-20 amino acid residues. Still morepreferably, the nucleic acid binding region of the single-chainantigen-binding polypeptide has at least 2 groups of eight consecutiveLys residues, Arg residues or a combination thereof, wherein each groupof eight consecutive lysine, arg or lys/arg residues is separated fromadjacent groups by 0-20 amino acid residues.

[0121] Alternatively, the nucleic acid binding regions of thesingle-chain antigen-binding polypeptide of the present invention can berepresented by the following formulas: 1) (KKKKKKKK)m (X)n (KKKKKKKK)(SEQ ID NO: 8), wherein K is lysine, m is an integer between 1 and 7 andn is an integer between 0 and 20; 2) (RRRRRRRR)m (X)n (RRRRRRRR) (SEQ IDNO: 9), wherein R is Arginine, m is an integer between 1 and 7 and n isan integer between 0 and 20; 3) (RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO:10), wherein K is lysine R is arginine such that the K and R residueseither alternate or are in random order, m is an integer between 1 and 7and n is an integer between 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK)(SEQ ID NO: 11), wherein K is lysine, R is arginine, m is an integerbetween 1 and 7 and n is an integer between 0 and 20. Preferably, theDNA binding regions of the single-chain antigen-binding polypeptide ofthe present invention can be represented by the following formulas: 1)(KKKKKKKK)m (X)n (KKKKKKKK) (SEQ ID NO: 8), wherein K is lysine, m is aninteger between 1 and 5 and n is an integer between 0 and 20; 2)(RRRRRRRR)m (X)n (RRRRRRRR) (SEQ ID NO: 9), wherein R is Arginine, m isan integer between 1 and 5 and n is an integer between 0 and 20; 3)(RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO: 10), wherein K is lysine R isarginine such that the K and R residues either alternate or are inrandom order, m is an integer between 1 and 5 and n is an integerbetween 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK) (SEQ ID NO: 11),wherein K is lysine, R is arginine, m is an integer between 1 and 5 andn is an integer between 0 and 20.

[0122] More preferably, the nucleic acid binding regions of thesingle-chain antigen-binding polypeptide of the present invention can berepresented by the following formulas: 1) (KKKKKKKK)m (X)n (KKKKKKKK)(SEQ ID NO: 8), wherein K is lysine, m is an integer between 1 and 3 andn is an integer between 0 and 20; 2) (RRRRRRRR)m (X)n (RRRRRRRR) (SEQ IDNO: 9), wherein R is Arginine, m is an integer between 1 and 3 and n isan integer between 0 and 20; 3) (RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO:10), wherein K is lysine R is arginine such that the K and R residueseither alternate or are in random order, m is an integer between 1 and 3and n is an integer between 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK)(SEQ ID NO: 11), wherein K is lysine, R is arginine, m is an integerbetween 1 and 3 and n is an integer between 0 and 20.

[0123] Still more preferably, the nucleic acid binding regions of thesingle-chain antigen-binding polypeptide of the present invention can berepresented by the following formulas: 1) (KKKKKKKK)m (X)n (KKKKKKKK)(SEQ ID NO: 8), wherein K is lysine, m is an integer between 1 and 2 andn is an integer between 0 and 20; 2) (RRRRRRRR)m (X)n (RRRRRRRR) (SEQ IDNO: 9), wherein R is Arginine, m is an integer between 1 and 2 and n isan integer between 0 and 20; 3) (RKRKRKRK)m (X)n (RKRKRKRK) (SEQ ID NO:10), wherein K is lysine R is arginine such that the K and R residueseither alternate or are in random order, m is an integer between 1 and 2and n is an integer between 0 and 20; and 4) (RRRRRRRR)m (X)n (KKKKKKKK)(SEQ ID NO: 11), wherein K is lysine, R is arginine, m is an integerbetween 1 and 2 and n is an integer between 0 and 20.

[0124] More preferably, the nucleic acid binding regions of thesingle-chain antigen-binding polypeptide of the present invention can berepresented by the following formulas: 1) (KKKKKKKK)m (X)n (KKKKKKKK)(SEQ ID NO: 8), wherein K is lysine, m is 1 and n is an integer between0 and 20; 2) (RRRRRRRR)m (X)n (RRRRRRRR) (SEQ ID NO: 9), wherein R isArginine, m is 1 and n is an integer between 0 and 20; 3) (RKRKRKRK)m(X)n (RKRKRKRK) (SEQ ID NO: 10), wherein K is lysine R is arginine suchthat the K and R residues either alternate or are in random order, m is1 and n is an integer between 0 and 20; and 4) (RRRRRRRR)m (X)n(KKKKKKKK) (SEQ ID NO: 11), wherein K is lysine, R is arginine, m is 1and n is an integer between 0 and 20. Even more preferably, thesingle-chain antigen-binding polypeptide has the region rich in basicamino acid residues, oligo lysine residues, oligo arginine residues orcombination thereof, configured such that the number of groups ofconsecutive Lys and or Arg residues is no higher than that which wouldresult in an unstable polynucleotide encoding the single-chainantigen-binding polypeptide or significantly reduce the efficiency oftranslation of the polynucleotide encoding the single-chainantigen-binding polypeptide.

[0125] These novel sFv proteins can be conjugated to activatedpolyethylene glycol (PEG) such that the PEG modification occurspreferentially at specifically engineered sites. See, U.S. applicationSer. No. 09/069,842, filed on Apr. 30,1998.

[0126] A further object of the invention is to produce monovalent andmultivalent sFvs having the oligo lysine tails of the present invention.For multivalent sFv, the association of two or more sFvs is required fortheir formation. For example, multivalent sFvs can be generated bychemically crosslinking two sFvs with C-terminal cysteine residues(Cumber et al., J Immunol. 149:120-126 (1992)) and by linking two sFvswith a third polypeptide linker to form a dimeric Fv (George et al., J.Cell. Biochem. 15E:127 (1991)). Details for producing multivalent sFvsby aggregation are described in Whitlow, M., et al., Protein Engng.7:1017-1026 (1994). Multivalent antigen-binding fusion proteins of theinvention can be made by any process, but preferably according to theprocess for making multivalent antigen-binding proteins set forth in WO93/11161, incorporated herein by reference.

[0127] Synthesis of the Minimal Nucleic Acid Binding Regions

[0128] In the present invention, a region rich in basic amino acidresidues, oligo-Lys, oligo-Arg or oligo-Lys/Arg nucleic acid bindingregion can occur in the C-terminus or N-terminus of the sFv polypeptide.Preferably, the nucleic acid binding region will occur in the C-terminusof the sFv polypeptide. The site at the C-terminus was chosen to be asfar from the antigen binding residues of the polypeptide as possible soas to prevent disruption of the antigen-binding site.

[0129] Site-directed mutagenesis is used to change the native proteinsequence of the single-chain antigen-binding protein to one thatincorporates the regions rich in Lys, Arg, oligo-Lys, oligo-Arg oroligo-Lys/Arg residues. The mutant protein gene is placed in anexpression system, such as bacterial cells, yeast or other fungal cells,insect cells or mammalian cells. The mutant protein can be purified bystandard purification methods.

[0130] Oligonucleotide-directed mutagenesis methods for generating theminimal basic amino acid nucleic acid binding regions or the presentinvention and related techniques for mutagenesis of cloned DNA are wellknown in the art. See, Sambrook et al, MOLECULAR CLONING: A LABORATORYMANUAL, 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.(1989); Ausubel et al. (eds.), CURRENT PROTOCOLS IN MOLECULAR BIOLOGY,John Wiley and Sons (1987), both incorporated herein by reference. Apreferred oligonucleotide-directed mutagenesis method for the presentinvention is according to Ho et al., Gene 77:51-59 (1989), incorporatedherein by reference.

[0131] Hosts and Vectors for the Preparation of the sFv Polypeptides

[0132] After mutating the nucleotide sequence of the sFv, the mutatedDNA can be inserted into a cloning vector for further analysis, such asfor confirmation of the DNA sequence. To express the polypeptide encodedby the mutated DNA sequence, the DNA sequence is operably linked toregulatory sequences controlling transcriptional expression andintroduced into either a prokaryotic or eukaryotic host cell.

[0133] Although sFvs are typically produced by prokaryotic host cells,eukaryotic cells can also be used as host cells. Preferred host cellsinclude E. coli, yeast or other fungal cells, insect cells or mammaliancells. Standard protein purification methods can be used to purify thesemutant proteins. Only minor modification to the native protein'spurification scheme can be required.

[0134] Also provided by the invention are DNA molecules such as purifiedgenetic sequences or plasmids or vectors encoding the sFv of theinvention that have engineered region(s) containing high content ofbasic amino acids, oligo-Lys, oligo-Arg or oligo-Lys/Arg residuescapable of non-specific nucleic acid binding. The DNA sequence for thesFv polypeptide can be chosen so as to optimize production in organismssuch as prokaryotes, yeast or other fungal cells, insect cells ormammalian cells.

[0135] The DNA molecule encoding an sFv having a region rich in basicamino acid residues, oligo-Lys, oligo-Arg, or oligo-Lys/Arg residueswhich comprise the minimal DNA binding region can be operably linkedinto an expression vector and introduced into a host cell to enable theexpression of the engineered sFv protein by that cell. A DNA sequenceencoding an sFv having a region rich in basic amino acid residues,oligo-Lys, oligo-Arg, or oligo-Lys/Arg regions can be recombined withvector DNA in accordance with conventional techniques. Recombinant hostsas well as methods of using them to produce single chain proteins of theinvention are also provided herein.

[0136] The expression of such sFv proteins of the invention can beaccomplished in procaryotic cells. Preferred prokaryotic hosts include,but are not limited to, bacteria such as Bacilli, Streptomyces and E.coli.

[0137] Eukaryotic hosts for cloning and expression of such sFv proteinsof the invention include plant cells, insect cells, yeast, fungi, andmammalian cells (such as, for example, human or primate cells) either invivo, or in tissue culture. A preferred host for the invention is Pichiapastoris. As discussed in more detail below, the inventors havedemonstrated excellent yields of the sFv proteins having the regionsrich in basic amino acid residues according to the present inventionusing Pichia pastoris.

[0138] The appropriate DNA molecules, hosts, methods of production,isolation and purification of monovalent, multivalent and fusion formsof proteins, especially sFv polypeptides, are thoroughly described inthe prior art, such as, e.g., U.S. Pat. No. 4,946,778, which is fullyincorporated herein by reference.

[0139] The sFv encoding sequence having the minimal DNA binding regioncomprising oligo-Lys, oligo-Arg, or oligo-Lys/Arg residues and anoperably linked promoter can be introduced into a recipient prokaryoticor eukaryotic cell either as a non-replicating DNA (or RNA) molecule,which can either be a linear molecule or, more preferably, a closedcovalent circular molecule. Since such molecules are incapable ofautonomous replication, the expression of the desired sFv protein canoccur through the transient expression of the introduced sequence.Alternatively, permanent expression can occur through the integration ofthe introduced sFv sequence into the host chromosome.

[0140] In one embodiment, the sFv sequence can be integrated into thehost cell chromosome. Cells which have stably integrated the introducedDNA into their chromosomes can be selected by also introducing one ormore markers which allow for selection of host cells which contain thesFv sequence and marker. The marker can complement an auxotrophy in thehost (such as his4, leu2, or ura3, which are common yeast auxotrophicmarkers), or can confer biocide resistance, e.g., antibiotics, orresistance to heavy metals, such as copper, or the like. The selectablemarker gene can either be directly linked to the sFv DNA sequence to beexpressed, or introduced into the same cell by co-transfection.

[0141] In another embodiment, the introduced sequence will beincorporated into a plasmid vector capable of autonomous replication inthe recipient host cell. Any of a wide variety of vectors can beemployed for this purpose. Factors of importance in selecting aparticular plasmid or viral vector include: the ease with whichrecipient cells that contain the vector can be recognized and selectedfrom those recipient cells which do not contain the vector; the numberof copies of the vector which are desired in a particular host; andwhether it is desirable to be able to “shuttle” the vector between hostcells of different species.

[0142] Any of a series of yeast vector systems can be utilized. Examplesof such expression vectors include the yeast 2-micron circle, theexpression plasmids YEP13, YCP and YRP, etc., or their derivatives. Suchplasmids are well known in the art (Botstein et al., Miami Wntr. Symp.19:265-274 (1982); Broach, J. R., In: The Molecular Biology of the YeastSaccharomyces: Life Cycle and Inheritance, Cold Spring HarborLaboratory, Cold Spring Harbor, N.Y., p. 445- 470 (1981); Broach, J. R.,Cell 28:203-204 (1982)).

[0143] For a mammalian host, several possible vector systems areavailable for expression. One class of vectors utilize DNA elementswhich provide autonomously replicating extra-chromosomal plasmids,derived from animal viruses such as bovine papilloma virus, polyomavirus, adenovirus, or SV40 virus. A second class of vectors relies uponthe integration of the desired gene sequences into the host chromosome.Cells which have stably integrated the introduced DNA into theirchromosomes can be selected by also introducing one or more markerswhich allow selection of host cells which contain the expression vector.The marker can provide prototrophy to an auxotrophic host, biocideresistance, e.g., antibiotics, or resistance to heavy metals, such ascopper or the like. The selectable marker gene can either be directlylinked to the DNA sequences to be expressed, or introduced into the samecell by co-transformation. Additional elements can also be needed foroptimal synthesis of mRNA. These elements can include splice signals, aswell as transcription promoters, enhancers, and termination signals. ThecDNA expression vectors incorporating such elements include thosedescribed by Okayama, H., Mol. Cell. Biol. 3:280 (1983), and others.

[0144] Among vectors preferred for use in bacteria are pQE70, pQE60 andpQE-9, available from Qiagen; pBS vectors, Phagescript vectors,Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available fromStratagene; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 availablefrom Pharmacia. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT,pOG44, pXT1 and pSG available from Stratagene; and pSVK3, pBPV, pMSG andpSVL available from Pharmacia. Preferred vectors for expression inPichia are pHIL-S1 (Invitrogen Corp.) and pIC9 (Invitrogen Corp.). Othersuitable vectors will be readily apparent to the skilled artisan.

[0145] Once the vector or DNA sequence containing the sFv constructs ofthe present invention has been prepared for expression, the DNAconstructs can be introduced or transformed into an appropriate host.Various techniques can be employed, such as transformation,transfection, protoplast fusion, calcium phosphate precipitation,electroporation, or other conventional techniques. After the cells havebeen transformed with the recombinant DNA (or RNA) molecule, the cellsare grown in media and screened for appropriate activities. Expressionof the sequence results in the production of the mutant sFv for use inthe gene delivery method of the present invention.

[0146] Expression and Purification of sFv Proteins

[0147] The inventors have demonstrated excellent yields of theoligo-lysine tailed sFv of the present invention, particularly CC49-16K,secreted from Pichia postoris. This provides a means of making enough ofthe DNA-binding sFv for commercial gene therapy applications. Inaddition, the inventors have discovered a novel purification procedurefor the oligo-lysine tailed sFv which is ionically bound to the nucleicacids from lysed cells in the fermentation broth. The oligo-lysinetailed sFv is initially found in the cell pellet fraction, but can bereadily released by salt treatment in excellent yield and purity.

[0148] The vectors pIC9 and pHIL-S1, and host strain GS115 were obtainedfrom Invitrogen Corporation and all cloning and expression work wasperformed as described in the “Pichia Expression Kit Instruction,Manual” supplied by Invitrogen. Clone number designations for CC49/218sFv variants are as follows. Clone # Plasmid # Vector C-terminal lysine# EN266(5) pEN262(5) pHIL-S1  8 K EN266(7) pEN262(7) pHIL-S1 16 KEN281(1) pEN278(1) pIC9  8 K EN282 pEN278(5) pIC9 16 K

[0149] Expression levels were >20 mg/L of protein as estimated bySDS-PAGE and Western analysis. The 8K (8 lysine tail) version and 16K(16 lysine tail) version of the sFv migrated on SDS-PAGE at positionsapproximately 1.6 KD and 3.3 KD greater in mass in agreement with thepredicted size from their polypeptide sequences. The sFv proteins wereall soluble in shake-flask experiments, but often were associated withthe cell pellet in fermentation cultures. The sFv was dissociated fromthe pellet by high salt wash (1.5 M NaCl, 20 mM Tris-HCl, pH 8.0 at roomtemperature for 2 hours). Consequently, this provided a very goodpurification step. Fermentation cultures contain substantial amounts oflysed cells and the 16K sFv variant proteins appear to bind to nucleicacids present in the fermentation medium. Significantly, the native sFvdoes not become cell associated. The Coomassie Blue stained SDS-PAGE gelof FIG. 5, is an example of the excellent expression of CC4916K 266(7)and the ability of salt treatment to solubilize and purify the sFv ofthe present invention.

[0150] Western analysis confirmed the major sFv molecules at about 26.5Kd for native sFv and about 30 Kd for CC49-16K sFv. This experiment wasperformed as follows: (1) 100 ml of expression medium of EN266(7) fromshake-flask culture was frozen and thawed, then centrifuged at 3,000rpm, room temperature (RT) for 30 min.; (2) EN266 cell pellet wasresuspended in 2 ml of 1.5 M NaCl, 20 mM Tris-HCI, pH 8.0, RT, 2 hrs.;(3) the sample was centrifuged as in (1) and the supernatant wasdialyzed against 0.15 M NaCl, 10 mM Tris-HCl, pH 8.0 at 4° C. overnight;(4) the protein content of the supernantant was quantitated at A280 tobe about 1.5 mg/ml; and (5) 30 μl of the supernatant were loaded onSDS-PAGE gels for Coomassie Blue staining and Western analysis.

[0151] The Western analysis was performed as follows: Immunoblottingprocedures for transfer of proteins from gels to nitrocellulosemembranes by the semi-dry method were performed as described in Harlow,E., & Lane, D., Antibodies: A Laboratory Manual, Cold Spring HarborLaboratory Press, Cold Spring Harbor, N.Y., (1988). Blot development wasalso performed according to the procedures in this manual. Briefly, theblotted membranes were blocked in 1% BSA blocking reagent in PBS at roomtemperature for 2 hr; washed 3× with PBS; and incubated with 3% BSA inPBS with a 1:1,000 dilution of rabbit anti-CC49/218 SCA antibody at 4°C. overnight. Next, a 3% BSA in PBS solution containing a 1:1000dilution of horseradish peroxidase conjugated goat anti-rabbit IgG wasused in a 1 hr incubation at room temperature. After washing with PBS,the membranes were developed with TMBM-500 (MOSS, Inc.) at roomtemperature for 1 min.

[0152] The purified sFvs of the present invention can be stored as astabilized protein composition having increased frozen storage stabilityas described in detail in U.S. Pat. No. 5,656,730, incorporated hereinby reference.

[0153] Administration

[0154] Administration of basic amino acid tailed sFv-nucleic acidconjugates of the invention for ex vivo and in vivo delivery of nucleicacids to mammalian cells will be by analogous methods to sFv where thediagnostic or therapeutic principle is directly linked to the sFv or aloaded carrier is linked by random binding to amine or carboxyl groupson amino acid residues of the sFv in a non-site-specific manner.

[0155] Conjugates of the present invention (immunoconjugates) can beformulated according to known methods to prepare pharmaceutically usefulcompositions, such as by admixture with a pharmaceutically acceptablecarrier vehicle. Suitable vehicles and their formulation are described,for example, in Remington's Pharmaceutical Sciences, 18th ed., Osol, A.,ed., Mack, Easton Pa. (1990). In order to form a pharmaceuticallyacceptable composition suitable for effective administration, suchcompositions will contain a therapeutically effective amount of theimmunoconjugate, either alone, or with a suitable amount of carriervehicle.

[0156] The immunoconjugate can be provided to a patient by means wellknown in the art. Such means of introduction include subcutaneous means,intramuscular means, intravenous means, intra-arterial means, orparenteral means. Intravenous, intraarterial or intrapleuraladministration is normally used for lung, breast, and leukemic tumors.Intraperitoneal administration is advised for ovarian tumors.Intrathecal administration is advised for brain tumors and leukemia.Subcutaneous administration is advised for Hodgkin's disease, lymphomaand breast carcinoma. Catheter perfusion is useful for metastatic lung,breast or germ cell carcinomas of the liver. Intralesionaladministration is useful for lung and breast lesions.

[0157] For therapeutic or diagnostic applications, compositionsaccording to the invention can be administered parenterally incombination with conventional injectable liquid carriers such as sterilepyrogen-free water, sterile peroxide-free ethyl oleate, dehydratedalcohol, or propylene glycol. Conventional pharmaceutical adjuvants forinjection solution such as stabilizing agent, solubilizing agents andbuffers, such as ethanol, complex forming agents such as ethylenediamine tetraacetic acid, tartrate and citrate buffers, andhigh-molecular weight polymers such as polyethylene oxide for viscosityregulation can be added. Such compositions can be injectedintramuscularly, intraperitoneally, or intravenously.

[0158] Further non-limiting examples of carriers and diluents includealbumin and/or other plasma protein components such as low densitylipoproteins, high density lipoproteins and the lipids with which theseserum proteins are associated. These lipids include phosphatidylcholine, phosphatidyl serine, phosphatidyl ethanolamine and neutrallipids such as triglycerides. Lipid carriers also include, withoutlimitation, tocopherol.

[0159] A typical regimen for preventing, suppressing, or treatingvarious pathologies comprises administration of an effective amount ofan sFv conjugate, administered over a period of one or several days, upto and including between one week and about 24 months.

[0160] It is understood that the dosage of the present inventionadministered in vivo or in vitro will be dependent upon the age, sex,health, and weight of the recipient, kind of concurrent treatment, ifany, frequency of treatment, and the nature of the effect desired. Theranges of effective doses provided below are not intended to limit theinvention and represent preferred dose ranges. However, the mostpreferred dosage will be tailored to the individual subject, as isunderstood and determinable by one of skill in the art, without undueexperimentation. See, e.g., Berkow et al., eds., Merck Manual, 16thedition, Merck and Co., Rahway, N.J. (1992); Goodman et al., eds.,Goodman and Gilman's The Pharmacological Basis of Therapeutics, 8thedition, Pergamon Press, Inc., Elmsford, N.Y. (1990); Avery's DrugTreatment: Principles and Practice of Clinical Pharmacology andTherapeutics, 3rd edition, ADIS Press, LTD., Williams and Wilkins,Baltimore, Md. (1987), Ebadi, Pharmacology, Little, Brown and Co.,Boston (1985), Katzung, Basic and Clinical Phamacology, Appleton andLange, Norwalk, Conn. (1992), which references and references citedtherein, are entirely incorporated herein by reference.

[0161] The total dose required for each treatment can be administered bymultiple doses or in a single dose. Effective amounts of adiagnostic/pharmaceutical compound or composition of the presentinvention are from about 0.001 μg to about 100 mg/kg body weight,administered at intervals of 4-72 hours, for a period of 2 hours to 5years, or any range or value therein, such as 0.01-1.0, 1.0-10,10-50 and50-100 mg/kg, at intervals of 1-4,6-12,12-24 and 24-72 hours, for aperiod of 0.5, 1.0-2.0, 2.0-4.0 and 4.0-7.0 days, or 1, 1-2, 2-4, 4-52or more weeks, or 1, 2, 3-10, 10-20, 20-60 or more years, or any rangeor value therein.

[0162] Preparations for parenteral administration include sterileaqueous or non-aqueous solutions, suspensions, and emulsions, which cancontain auxiliary agents or excipients which are known in the art. See,e.g., Berker, supra, Goodman, supra, Avery, supra and Ebadi, supra,which are entirely incorporated herein by reference, including allreferences cited therein.

[0163] Pharmaceutical compositions comprising at least one type of sFvconjugate having a basic amino acid rich region according to theinvention, or, 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 types of sFv conjugates,of the present invention can be contained in an amount effective toachieve its intended purpose. In addition to at least one sFv conjugate,a pharmaceutical composition can contain suitable pharmaceuticallyacceptable carriers, such as excipients, carriers and/or auxiliarieswhich facilitate processing of the active compounds into preparationswhich can be used pharmaceutically.

[0164] Pharmaceutical compositions can also include suitable solutionsfor administration intravenously, subcutaneously, dermally, orally,mucosally or rectally, and contain from about 0.01 to 99 percent,preferably from about 20 to 75 percent of active component (i.e., theDNA binding sFv conjugate) together with the excipient. Pharmaceuticalcompositions for oral administration include tablets and capsules.Additional lipid and lipoprotein drug delivery systems that can beincluded herein are described more fully in Annals N.Y. Acad. Sci.507:775-88, 98-103, and 252-271, which disclosure is hereby incorporatedby reference.

[0165] For example, the sFvs of the present invention can be prepared asa pharmaceutically acceptable, single-chain antigen-binding proteincomposition having increased frozen-storage stability, as described indetail in U.S. Pat. No 5,656,730, incorporated by reference.

[0166] Having now generally described this invention, the same will bebetter understood by reference to certain specific examples, which areincluded for the purpose of illustration and not intended to be limitingunless otherwise specified

EXAMPLES Example 1 Demonstration of DNA Binding

[0167] In order to demonstrate the DNA binding capabilities of theC-terminal oligo-lysine tailed SCAs the following experiment wasperformed. The CC49/218 SCA and A33/218 SCA having a 16 lysine (“16K”)C-terminal tail and the CC49/218 SCA and A33/218 SCA having a 8 lysine(“8K”) C-terminal tail were expressed from Pichia.

[0168] Genetic construction of the sFv proteins having an oligo-lysineC-terminal tails was performed by first introducing a unique BstEIIrestriction site (GGTNACC) into VH codons including positions 108, 109and 110 (Kabat numbers) by standard site directed mutagenesis. Thismutation does not alter the encoded amino acids and is accomplished bysimply changing the position 108 codon from TCA (Ser) to TCG (Ser), asingle base change. The unique BstEII restriction site can be digestedwith the restriction enzyme BstEII and a synthetic linker having BstEIIcompatible overhangs is ligated into the site. The synthetic linker usedconsists of two complementary oligonucleotides 5′ GTC ACC GTC TCC AAAAAG AAG AAA AAA AAG AAA AAG 3′ (SEQ ID NO: 12); and 5′ GT GAC CTT TTTCTT TTT TTT CTT CTT TTT GAA GAC G 3′ (SEQ ID NO: 13).

[0169] This linker can be inserted as a single copy or as two or moretandem copies due to the compatible overhangs. In the case of the 16lysine tail sFv, two tandem copes of this linker are presented asconfirmation by DNA sequencing of the genetic construction.

[0170] The proteins were assayed for DNA binding function in a standardGel Shift assay (Mistry et al. Biotechniques 22:718-729 (1997)). TheA33/218 SCA having a 16 lysine C-terminal tail (EN266 (3F)) wasincubated with plasmid pFLAG-1(International Biotechnologies, Inc.) (0.5μg) in DNA binding buffer (0.01M Tris, pH8.0, 0.15M NaCl). The sampleswere then electrophoresed on the gel shown in FIG. 4. The results showthat this SCA protein had DNA binding capability. The CC49/218 SCAhaving a 16 lysine C-terminal tail (EN278(5)) also bound DNA. Theresults in FIG. 4 show that supercoiled DNA species (faster movingspecies) was more effectively complexed by the SCA molecules than thenicked linear DNA species and are consistent with the results shown byMistry et al. The CC49/218 SCA and A33/218 SCA having a 8 lysineC-terminal tail, however, did not show DNA binding capacity by thisassay.

Example 2 Transfection of Mammalian Cells

[0171] In order to demonstrate the transfection of mammalian cells usingthe oligo-lysine single-chain antigen binding polypeptide of the presentinvention the following experiment can be performed.

[0172] CC49/218 sFv protein engineered to contain a 16 lysine tail(FIGS. 2A and B) is expressed and secreted by Pichia pastoris strainEN266. The protein is purified by standard cation and anion exchangechromatography well known to those skilled in the art. The protein isthen concentrated by diafiltration. The sFv is incubated with reporterplasmid DNA, such as one of the pRL vectors (Promega Corp.). The sFv andplasmid DNA are incubated for 10-60 minutes in buffer (0.01M Tris, pH8.0, 0.15M NaCl) in the following concentration ratio: 0.5 μg of pRLvector and 10 μg of sFv polypeptide. Controls for the transfectionexperiment are 1) CC49/218 sFv lacking the 16 lysine tail incubated withpRL vector and 2) plasmid alone.

[0173] The sFv/plasmid complex is then incubated with cultured LS-174Tcells which are resuspended in the buffer (0.01 M Tris, pH 8.0, 0.15 MNaCl), for 10-60 minutes. The cells are then centrifuged at 2,000 rpmfor five minutes and washed once with incubation buffer. The cells arenext suspended in electroporation buffer (1xHBS: 20 mM HEPES, pH 7.05,137 mM NaCl, 5 mM KCl, 0.7 mM Na₂HPO₄, 6 mM dextrose). One set of cellsare subjected to electroporation using BTX Electro Cell Manipulator 600System according to the manufacturer's instructions. Another cell ofcells are examined for the spontaneous uptake of the sFv/plasmid complexby omitting the electroporation step. The success of transfection of thecells by the reporter plasmid is quantitated by luciferase assaysperformed according to the protocol described by Promega Corp., inPromega Notes #57.

Example 3 Demonstration of DNA Binding

[0174] Additional Gel Shift assays demonstrating the DNA bindingcapacity of CC49-16K (EN266(7)) and A33-16K were performed as follows.Experimental conditions are as in Example 1 except as follows.

[0175] CC49-16K (EN266(7)) was purified by DEAE column chromatographyand fraction 8 (OD280=3.6) was dialyzed versus 0.15 M NaCl, 10 mMTris-HCl, pH 8.0 at 4° C. overnight. Aliquots of the samples (0-90 μl )were mixed with 1 μl of plasmid Bluescript SK⁻ ⁽3 ug/μl) and distilledwater was added to a final volume of 100 μl. The samples were incubatedat RT for 1 hr. Twenty μl of each sample were loaded and run on a 1.2%agarose gel, 100V, 2 hrs.

[0176] The A33-16K sample was incubated as in Example 1 except theplasmid used was pFLAG from IBI, Inc. (1 μg/μl) and incubation was doneat RT for 1 hr.

Example 4 ELISA Assay

[0177] An ELISA assay demonstrating retention of mucin-binding activityof the CC49-16K sFv EN266(7) shown in FIG. 6 was performed as follows.The ELISA was performed by 1:2 serial dilutions of the sFv samples.Immunoassay procedures were performed using modifications of protocolsfrom Harlow, E., & Lane, D., Antibodies: A Laboratory Manual, ColdSpring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (1988). Directbinding assays were performed and a dose response curve was constructed.Bovine submaxillary mucin (250 ng per 100 μl well) antigen was used tocoat microtiter plate wells (MaxiSorp, Nunc, VWR Scientific, Boston,Mass.). The EN266(7) or purified CC49/218 SCA proteins were dilutedserially in PBS containing 1% BSA and incubated in the coated wells at22° C. for 1 hr. After the plate was washed three times with PBScontaining 0.05% Tween 20 (PBS-T), the bound SCA was detected by a 1 hrincubation with a secondary antibody (rabbit anti-CC49/218, 1:2000dilution, 37° C.), followed by three PBS-T washes, and a 1 hr incubationat 37° C. with horseradish peroxidase conjugated goat anti-rabbit IgGantibody (1:2000 dilution). Plates were washed 3 times with PBS-T andwere read at 540 nm following addition of 100 μl of3′,3′,5′,5′-tetramethylbenzidine (TMB).

[0178] EN234 is native CC49/218 sFv produced from Pichia. GX9251 isnative CC49/218 sFv produced from E. coli. The BSA control is notplotted on the graph. These results indicate that antigen-bindingactivity of the sFv is not substantially altered in the sFv-16K variantprotein.

[0179] In the next experiments, these oligo-lysine tailed sFv proteinswere shown to be capable of gene delivery in vitro. First, these resultsdemonstrate that CC49-16K sFv which is complexed with plasmid DNAadheres to TAG-72 antigen bearing LS174-T cells. Second, these resultsdemonstrate that the CC49-16K sFv/plasmid complex can markedly enhancethe LipofectAmine transfection of DNA into the LS174-T cells.

Example 5 Targeted DNA Delivery

[0180] Binding of CC49-16K sFv/plasmid DNA complex to LS174-T cells wasdemonstrated by in situ immunochemistry.

[0181] Experimental Protocol: LS174-T cells were grown in T75 tissueculture flasks in MEM medium containing 1× non-essential amino acids, 1×Earles' salts and 1% fetal bovine serum at 37° C. with 5% CO₂. Cells(2×10⁶ cells) in a T75 flask were treated with 1× trypsin/EDTA and splitinto four T75 flasks and incubated 16 hrs. The experimental procedure isas follows. (1)8×10⁶ LS174-T cells were collected by trypsin/EDTAdigestion from the 4 T75 flasks. (2) The cells were centrifuged at 3,000rpm at 4° C. (3) The supernatant was discarded and the cells wereresuspended in PBS at 4° C. (4) The cells were centrifuged at 3,000 rpm,the supernatant discarded, and 2 ml of 1% paraformaldehyde (PFA) in PBSwas added to the pellet on ice for 30 min. (5) The cells were washedwith PBS, twice at 4° C. (6) The CC49-16K sFv and GX9251 samples wereanalyzed and quantitated by SDS-PAGE. (7) 5 μl (0.1 μg) of DIG-labeled(digoxigenin-labeled) pBR328 plasmid DNA (Boehringer Mannheim Cat. No.1585 738) was mixed with 200 μl of native GX9251 CC49/218 sFv (15 μg/ml)or with 200 μl of EN266(7) CC49-16K sFv (approx. 5 μg/ml) at RT for30-min. (8) The LS174-T cells were added to the DIG-labeled pBR328plasmid/sFv mixture and incubated at RT for 30 min. (9) The cells werewashed twice with PBS. (10) The cells were centrifuged and resuspendedin 200 μl of PBS containing 1% BSA and a 1:100 dilution ofanti-digoxigenin-AP (alkaline phosphatase) Fab (Boehringer Mannheim Cat.No. 1093 274). Incubation was done at RT for 30 min. (11). The cellswere washed twice with PBS. (12). The cells were centrifuged and thepellet was resuspended in 100 μl of Fast Red solution (one tablet ofFast Red was dissolved in 500 μl of 0.1M Tris-HCl, 0.15 M NaCl, pH 8.3;Fast Red Tablets are obtained from Boehringer Mannheim, Cat. No. 1 496549). Incubation was at RT for 30 min. (13). 50 μl of each sample werepipetted onto a glass slide, covered with a cover slip, and observedimmediately under a microscope (Nikon) using a 20× object lense andphotographed.

[0182] GX9251 CC49/218 sFv sample was used in the complex with thepBR328 plasmid DNA. Background staining was minimal. EN266(7) CC49-16KsFv sample was used in the complex with the plasmid DNA. Positive redstaining was visually intense. Staining was more apparent in regions ofcell debris presumably due to the nature of the cell surface TAG-72antigen which is repetitive and easily shed making it more denselyconcentrated in these regions. Since the detection signal results fromthe presence of the DIG-labeled pBR328 plasmid DNA, this experimentdemonstrated (1) that the CC49-16K sFv can target plasmid to LS174-Tcells but native CC49/218 sFv can not do so and (2) that the affinity ofthe plasmid for the CC49-16K sFv proteins is sufficient to remaincomplexed through several washing steps.

Example 6 Cell Transfection

[0183] This example demonstrates the transfection of LS174-T cells byreporter plasmid pSEAP2 using CC49-16K sFv as carrier.

[0184] Protocol: The SEAP Reporter plasmid system (PT3057-2) wasobtained from Clontech (Palo Alto, Calif.) and used according to thesupplier's instructions. The pSEAP2 plasmid expresses a gene encoding asecreted alkaline phosphatase which serves as a reporter for successfultransfection of a cultured cell. The LIPOFECTAMINE PLUS reagent whichenhances transfection of DNA was obtained from Life Technologies(Gaithersburg, Md. Cat. No. 10964-013) and used according to thesupplier's instructions. As initial controls, DNA binding of theCC49-16K to pSEAP2 was demonstrated by Gel Shift experiments asdescribed in Examples 1 and 3 and a suitable sFv to plasmid ratio wasdetermined as stated below. Plasmid pSEAP2 was also shown to besuccessfully transfected into LS174-T cells by the Lipofectamine methodusing the recommended protocol. Furthermore, the AP reporterchemiluminescence signal could be quantitated by exposure to an X-rayfilm such that the strength of the signals (grains on the film) wereproportional to the amount of plasmid added over a 0-5 μg range.

[0185] The demonstration of CC49-16K sFv targeted transfection of LS174-T by plasmid pSEAP2 employed the following protocol. All testarticles were done in duplicate. (1) LS174-T cells (3×10⁶) were platedon each well of a six well (Costar) plate in DMGM medium with 10% fetalbovine serum (FBS), at 37° C. with 5% CO₂ overnight. (2) The cells werewashed with HBSS. (3) In separate microfuge tubes, (a) 5 μl (5 μg) ofplasmid pSEAP2 and 50 μl (about 50 μg) of EN266(7) CC49-16K sFv; OR (b)5 μl (5 μg) of plasmid pSEAP2 and 50 μl (about 50 μg) of EN234 CC49/218native sFv; OR (c) 5 μl (5 μg) of plasmid pSEAP and 50 μl of water weremixed and incubated at room temperature for 30 min in 200 μl of DMEMmedium. (4) The sFv/plasmid mixtures were added onto the LS174-T cellsand incubated at 37° C. for 60 min. (5) The cells were washed twice withHBSS. (6) 12 μl of PLUS reagent (Life Technologies Lipofectamine Pluskit) were mixed with 100 μl of DMEM medium and added onto the LS174-Tcells, then incubated at 37° C. for 30 min. (7) 8 μl of Lipofectaminewas mixed into 100 μl of DMEM and added to each well of the LS174-Tplate, then incubated at 37° C. for 30 min. (8) 0.8 ml of DMEM was addedto each well and incubated at 37° C. for 3 hrs. (9) 100 μl of FBS and 1ml of DMGM with 10% FBS were added. Incubation continued at 37° C. with5% CO₂ for 2 days. (10) 1 ml of culture medium was transferred from eachwell into 1.5 ml microfuge tubes. (11) The cells were centrifuged in amicrofuge to pellet the cells and debris. (12) The supernatants fromeach tube were transferred into a Centricon 10 (Amicon Inc.) andconcentrated to a volume of 0.1 ml. (13) 25 μl of each sample were mixedwith 75 μl of 1× dilution buffer (Clontech) and incubated at 65° C. for30 min. (14) The samples were cooled to room temperature and 100 μl ofassay buffer (Clontech) were added with incubation at room temperaturefor 5 min. (15) 100 μl of 1.25 mM CSPD with 1× chemiluminescenceenhancer (Clontech) were mixed into each tube. (16) 150 μl of eachsample were transferred into individual wells of a DYNATECH microFLUORplate. (17) The microtiter plate was overlayed with X-ray film and thefilm was exposed for 3 hrs at room temperature.

[0186] The results are shown in FIGS. 7A and 7B. Lanes in the exposedx-ray film with duplicate lanes top and bottom are as follows. Lanes: 1.Positive control with 0.5 μl of pure placental alkaline phosphatase inoverexposed well; 2. Standard (Lipofectamine Plus) transfection ofpSEAP2 without sFv (i.e., condition c above) where washing steps (5)above are omitted; 3. LS174-T cell control with no added plasmid or sFv;4. pSEAP plasmid transfection without sFv (condition c above); 5. pSEAP2plasmid plus EN234 native sFv as described for condition b above; 6.pSEAP2 plasmid plus EN266(7) CC49-16K sFv as described for condition aabove; 7. Same as lane 6 except both protocol steps 5 (washings) and 7(Lipofectamine) are omitted; 8. Control with DMEM medium alone. FIGS. 7Aand 7B show the area quantitations of the x-ray film which werepreformed by densitometry scanning using a Molecular Dynamics PD-SIlaser scanner. Quantitation data are provided for both top (FIG. 7A;lane 8 was not scanned) and bottom (FIG. 7B; lanes 1 and 8 were notscanned) rows. Note that CC49-16K sFv (lane 6) promotes transfectionabout 8-fold over plasmid-alone control levels (lane 4) in thisexperiment.

[0187] Summary of transfection experiments: The area quantitationresults demonstrate that plasmid pSEAP can not efficiently transfect thecells in the absence of CC49-16K sFv (lane 4). The plasmid is simplywashed off the cells in step 5. However, CC49-16K inclusion (lane 6)allows the sFv/plasmid complexes to remain attached to the cells andtransfection proceeds as efficiently as in a standard transfectionprotocol (lane 2) where the washing steps are omitted. Lane 5 shows thatnative sFv has a minor but detectable enhancement of transfection. Thismay be due to nonspecific association of this very basic (pI ˜9.3) sFvwith the negatively charged DNA. Lane 7 suggests that theCC49-16K/plasmid complex with no Lipofectamine added is slightly betterthan standard (+lipofectamine) transfection (with no washing at step 5).

Example 7 Synthesis of DNA Binding Regions in Other sFvs

[0188] Oligonucleotide-directed mutagenesis, synthetic linker ligationor polymerase chain reaction can be employed to create oligo-lysine oroligo-arginine C-terminal tail in an sFv having a Kabat consensusV_(K)I/218 /V_(H)III sFv (FIG. 8), C6.5/218 sFv (FIG. 9), and A33/218sFv. Amino acid assignments of the Kabat consensus V_(K)I/218/V_(H)IIIsFv and A33/218 sFv are according to Kabat et al., Sequences of Proteinsof lmmunological Interest, pp. 108 & 331, 5th ed., U.S. Dept. Health andHuman Services, Bethesda, Md. (1991), where the assigned amino acidresidue at a position is the most commonly occurring amino acid at thatposition. Amino acid assignments of the wild-type C6.5 variable domainsare according to Schier, R., et al., J. Mol. Biol. 255:28-43 (1996).

[0189] The mutated sFvs are individually ligated into the Pichiatransfer plasmid pHIL-S1 or pIC9 (Invitrogen Corp.) and transformed intoPichia pastoris. Detailed protocols for these procedures are presentedin the Pichia Expression Kit Instruction Manual Cat. No. X1710-01 (1994)from Invitrogen Corporation. The sFv variants are placed behind a yeastsignal sequence in these constructions and the integrated sFv in theyeast transformants are tested for secretion of the sFv proteins.Evaluation of expression is done by Coomassie staining of SDS-PAGE gels.

[0190] Although the foregoing refers to particular preferredembodiments, it will be understood that the present invention is not solimited. It will occur to those skilled in the art that variousmodifications can be made to the disclosed embodiments and that suchmodifications are intended to be within the scope of the presentinvention.

[0191] All documents, e.g., scientific publications, patents and patentpublications recited herein are hereby incorporated by reference intheir entirety to the same extent as if each individual document wasspecifically and individually indicated to be incorporated by referencein its entirety. Where the document cited only provides the first pageof the document, the entire document is intended, including theremaining pages of the document.

1 13 1 782 DNA Artificial Sequence Description of Artificial SequenceCC49/218 sFv 1 gac gtc gtg atg tca cag tct cca tcc tcc cta cct gtg tcagtt ggc 48 Asp Val Val Met Ser Gln Ser Pro Ser Ser Leu Pro Val Ser ValGly 1 5 10 15 gag aag gtt act ttg agc tgc aag tcc agt cag agc ctt ttatat agt 96 Glu Lys Val Thr Leu Ser Cys Lys Ser Ser Gln Ser Leu Leu TyrSer 20 25 30 ggt aat caa aag aac tac ttg gcc tgg tac cag cag aaa cca gggcag 144 Gly Asn Gln Lys Asn Tyr Leu Ala Trp Tyr Gln Gln Lys Pro Gly Gln35 40 45 tct cct aaa ctg ctg att tac tgg gca tcc gct agg gaa tct ggg gtc192 Ser Pro Lys Leu Leu Ile Tyr Trp Ala Ser Ala Arg Glu Ser Gly Val 5055 60 cct gat cgc ttc aca ggc agt gga tct ggg aca gat ttc act ctc tcc240 Pro Asp Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Ser 6570 75 80 atc agc agt gtg aag act gaa gac ctg gca gtt tat tac tgt cag cag288 Ile Ser Ser Val Lys Thr Glu Asp Leu Ala Val Tyr Tyr Cys Gln Gln 8590 95 tat tat agc tat ccc ctc acg ttc ggt gct ggg acc aag ctt gtg ctg336 Tyr Tyr Ser Tyr Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Val Leu 100105 110 aaa ggc tct act tcc ggt agc ggc aaa ccc ggg agt ggt gaa ggt agc384 Lys Gly Ser Thr Ser Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser 115120 125 act aaa ggt cag gtt cag ctg cag cag tct gac gct gag ttg gtg aaa432 Thr Lys Gly Gln Val Gln Leu Gln Gln Ser Asp Ala Glu Leu Val Lys 130135 140 cct ggg gct tca gtg aag att tcc tgc aag gct tct ggc tac acc ttc480 Pro Gly Ala Ser Val Lys Ile Ser Cys Lys Ala Ser Gly Tyr Thr Phe 145150 155 160 act gac cat gca att cac tgg gtg aaa cag aac cct gaa cag ggcctg 528 Thr Asp His Ala Ile His Trp Val Lys Gln Asn Pro Glu Gln Gly Leu165 170 175 gaa tgg att gga tat ttt tct ccc gga aat gat gat ttt aaa tacaat 576 Glu Trp Ile Gly Tyr Phe Ser Pro Gly Asn Asp Asp Phe Lys Tyr Asn180 185 190 gag agg ttc aag ggc aag gcc aca ctg act gca gac aaa tcc tccagc 624 Glu Arg Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser195 200 205 act gcc tac gtg cag ctc aac agc ctg aca tct gag gat tct gcagtg 672 Thr Ala Tyr Val Gln Leu Asn Ser Leu Thr Ser Glu Asp Ser Ala Val210 215 220 tat ttc tgt aca aga tcc ctg aat atg gcc tac tgg ggt caa ggaacc 720 Tyr Phe Cys Thr Arg Ser Leu Asn Met Ala Tyr Trp Gly Gln Gly Thr225 230 235 240 tcg gtc acc gtc tcc aaa aag aag aaa aaa aag aaa aag gtcacc gtc 768 Ser Val Thr Val Ser Lys Lys Lys Lys Lys Lys Lys Lys Val ThrVal 245 250 255 tcc taataggatc c 782 Ser 2 257 PRT Artificial SequenceDescription of Artificial Sequence CC49/218 sFv 2 Asp Val Val Met SerGln Ser Pro Ser Ser Leu Pro Val Ser Val Gly 1 5 10 15 Glu Lys Val ThrLeu Ser Cys Lys Ser Ser Gln Ser Leu Leu Tyr Ser 20 25 30 Gly Asn Gln LysAsn Tyr Leu Ala Trp Tyr Gln Gln Lys Pro Gly Gln 35 40 45 Ser Pro Lys LeuLeu Ile Tyr Trp Ala Ser Ala Arg Glu Ser Gly Val 50 55 60 Pro Asp Arg PheThr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Ser 65 70 75 80 Ile Ser SerVal Lys Thr Glu Asp Leu Ala Val Tyr Tyr Cys Gln Gln 85 90 95 Tyr Tyr SerTyr Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Val Leu 100 105 110 Lys GlySer Thr Ser Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser 115 120 125 ThrLys Gly Gln Val Gln Leu Gln Gln Ser Asp Ala Glu Leu Val Lys 130 135 140Pro Gly Ala Ser Val Lys Ile Ser Cys Lys Ala Ser Gly Tyr Thr Phe 145 150155 160 Thr Asp His Ala Ile His Trp Val Lys Gln Asn Pro Glu Gln Gly Leu165 170 175 Glu Trp Ile Gly Tyr Phe Ser Pro Gly Asn Asp Asp Phe Lys TyrAsn 180 185 190 Glu Arg Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys SerSer Ser 195 200 205 Thr Ala Tyr Val Gln Leu Asn Ser Leu Thr Ser Glu AspSer Ala Val 210 215 220 Tyr Phe Cys Thr Arg Ser Leu Asn Met Ala Tyr TrpGly Gln Gly Thr 225 230 235 240 Ser Val Thr Val Ser Lys Lys Lys Lys LysLys Lys Lys Val Thr Val 245 250 255 Ser 3 818 DNA Artificial SequenceDescription of Artificial Sequence CC49/218 sFv 3 gac gtc gtg atg tcacag tct cca tcc tcc cta cct gtg tca gtt ggc 48 Asp Val Val Met Ser GlnSer Pro Ser Ser Leu Pro Val Ser Val Gly 1 5 10 15 gag aag gtt act ttgagc tgc aag tcc agt cag agc ctt tta tat agt 96 Glu Lys Val Thr Leu SerCys Lys Ser Ser Gln Ser Leu Leu Tyr Ser 20 25 30 ggt aat caa aag aac tacttg gcc tgg tac cag cag aaa cca ggg cag 144 Gly Asn Gln Lys Asn Tyr LeuAla Trp Tyr Gln Gln Lys Pro Gly Gln 35 40 45 tct cct aaa ctg ctg att tactgg gca tcc gct agg gaa tct ggg gtc 192 Ser Pro Lys Leu Leu Ile Tyr TrpAla Ser Ala Arg Glu Ser Gly Val 50 55 60 cct gat cgc ttc aca ggc agt ggatct ggg aca gat ttc act ctc tcc 240 Pro Asp Arg Phe Thr Gly Ser Gly SerGly Thr Asp Phe Thr Leu Ser 65 70 75 80 atc agc agt gtg aag act gaa gacctg gca gtt tat tac tgt cag cag 288 Ile Ser Ser Val Lys Thr Glu Asp LeuAla Val Tyr Tyr Cys Gln Gln 85 90 95 tat tat agc tat ccc ctc acg ttc ggtgct ggg acc aag ctt gtg ctg 336 Tyr Tyr Ser Tyr Pro Leu Thr Phe Gly AlaGly Thr Lys Leu Val Leu 100 105 110 aaa ggc tct act tcc ggt agc ggc aaaccc ggg agt ggt gaa ggt agc 384 Lys Gly Ser Thr Ser Gly Ser Gly Lys ProGly Ser Gly Glu Gly Ser 115 120 125 act aaa ggt cag gtt cag ctg cag cagtct gac gct gag ttg gtg aaa 432 Thr Lys Gly Gln Val Gln Leu Gln Gln SerAsp Ala Glu Leu Val Lys 130 135 140 cct ggg gct tca gtg aag att tcc tgcaag gct tct ggc tac acc ttc 480 Pro Gly Ala Ser Val Lys Ile Ser Cys LysAla Ser Gly Tyr Thr Phe 145 150 155 160 act gac cat gca att cac tgg gtgaaa cag aac cct gaa cag ggc ctg 528 Thr Asp His Ala Ile His Trp Val LysGln Asn Pro Glu Gln Gly Leu 165 170 175 gaa tgg att gga tat ttt tct cccgga aat gat gat ttt aaa tac aat 576 Glu Trp Ile Gly Tyr Phe Ser Pro GlyAsn Asp Asp Phe Lys Tyr Asn 180 185 190 gag agg ttc aag ggc aag gcc acactg act gca gac aaa tcc tcc agc 624 Glu Arg Phe Lys Gly Lys Ala Thr LeuThr Ala Asp Lys Ser Ser Ser 195 200 205 act gcc tac gtg cag ctc aac agcctg aca tct gag gat tct gca gtg 672 Thr Ala Tyr Val Gln Leu Asn Ser LeuThr Ser Glu Asp Ser Ala Val 210 215 220 tat ttc tgt aca aga tcc ctg aatatg gcc tac tgg ggt caa gga acc 720 Tyr Phe Cys Thr Arg Ser Leu Asn MetAla Tyr Trp Gly Gln Gly Thr 225 230 235 240 tcg gtc acc gtc tcc aaa aagaag aaa aaa aag aaa aag gtc acc gtc 768 Ser Val Thr Val Ser Lys Lys LysLys Lys Lys Lys Lys Val Thr Val 245 250 255 tcc aaa aag aag aaa aaa aagaaa aag gtc acc gtc tcc taataggatc c 818 Ser Lys Lys Lys Lys Lys Lys LysLys Val Thr Val Ser 260 265 4 269 PRT Artificial Sequence Description ofArtificial Sequence CC49/218 sFv 4 Asp Val Val Met Ser Gln Ser Pro SerSer Leu Pro Val Ser Val Gly 1 5 10 15 Glu Lys Val Thr Leu Ser Cys LysSer Ser Gln Ser Leu Leu Tyr Ser 20 25 30 Gly Asn Gln Lys Asn Tyr Leu AlaTrp Tyr Gln Gln Lys Pro Gly Gln 35 40 45 Ser Pro Lys Leu Leu Ile Tyr TrpAla Ser Ala Arg Glu Ser Gly Val 50 55 60 Pro Asp Arg Phe Thr Gly Ser GlySer Gly Thr Asp Phe Thr Leu Ser 65 70 75 80 Ile Ser Ser Val Lys Thr GluAsp Leu Ala Val Tyr Tyr Cys Gln Gln 85 90 95 Tyr Tyr Ser Tyr Pro Leu ThrPhe Gly Ala Gly Thr Lys Leu Val Leu 100 105 110 Lys Gly Ser Thr Ser GlySer Gly Lys Pro Gly Ser Gly Glu Gly Ser 115 120 125 Thr Lys Gly Gln ValGln Leu Gln Gln Ser Asp Ala Glu Leu Val Lys 130 135 140 Pro Gly Ala SerVal Lys Ile Ser Cys Lys Ala Ser Gly Tyr Thr Phe 145 150 155 160 Thr AspHis Ala Ile His Trp Val Lys Gln Asn Pro Glu Gln Gly Leu 165 170 175 GluTrp Ile Gly Tyr Phe Ser Pro Gly Asn Asp Asp Phe Lys Tyr Asn 180 185 190Glu Arg Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser 195 200205 Thr Ala Tyr Val Gln Leu Asn Ser Leu Thr Ser Glu Asp Ser Ala Val 210215 220 Tyr Phe Cys Thr Arg Ser Leu Asn Met Ala Tyr Trp Gly Gln Gly Thr225 230 235 240 Ser Val Thr Val Ser Lys Lys Lys Lys Lys Lys Lys Lys ValThr Val 245 250 255 Ser Lys Lys Lys Lys Lys Lys Lys Lys Val Thr Val Ser260 265 5 265 PRT Artificial Sequence Description of Artificial SequenceA33/218 sFv 5 Asp Val Val Met Thr Gln Ser Gln Lys Phe Met Ser Thr SerVal Gly 1 5 10 15 Asp Arg Val Ser Ile Thr Cys Lys Ala Ser Gln Asn ValArg Thr Val 20 25 30 Val Ala Trp Tyr Gln Gln Lys Pro Gly Gln Ser Pro LysThr Leu Ile 35 40 45 Tyr Leu Ala Ser Asn Arg His Thr Gly Val Pro Asp ArgPhe Thr Gly 50 55 60 Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser AsnVal Gln Ser 65 70 75 80 Glu Asp Leu Ala Asp Tyr Phe Cys Leu Gln His TrpSer Tyr Pro Leu 85 90 95 Thr Phe Gly Ser Gly Thr Lys Leu Glu Val Lys GlySer Thr Ser Gly 100 105 110 Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser ThrLys Gly Glu Val Lys 115 120 125 Leu Val Glu Ser Gly Gly Gly Leu Val LysPro Gly Gly Ser Leu Lys 130 135 140 Leu Ser Cys Ala Ala Ser Gly Phe AlaPhe Ser Thr Tyr Asp Met Ser 145 150 155 160 Trp Val Arg Gln Thr Pro GluLys Arg Leu Glu Trp Val Ala Thr Ile 165 170 175 Ser Ser Gly Gly Ser TyrThr Tyr Tyr Leu Asp Ser Val Lys Gly Arg 180 185 190 Phe Thr Ile Ser ArgAsp Ser Ala Arg Asn Thr Leu Tyr Leu Gln Met 195 200 205 Ser Ser Leu ArgSer Glu Asp Thr Ala Leu Tyr Tyr Cys Ala Pro Thr 210 215 220 Thr Val ValPro Phe Ala Tyr Trp Gly Gln Gly Thr Leu Val Thr Val 225 230 235 240 SerLys Lys Lys Lys Lys Lys Lys Lys Val Thr Val Ser Lys Lys Lys 245 250 255Lys Lys Lys Lys Lys Val Thr Val Ser 260 265 6 283 PRT ArtificialSequence Description of Artificial Sequence Kabat Consensus 6 Asp IleGln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 1 5 10 15 AspArg Val Thr Ile Thr Cys Arg Ala Ser Gln Ser Leu Val Ser Ile 20 25 30 SerAsn Tyr Leu Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys 35 40 45 LeuLeu Ile Tyr Ala Ala Ser Ser Leu Glu Ser Gly Val Pro Ser Arg 50 55 60 PheSer Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser 65 70 75 80Leu Gln Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln Tyr Asn Ser 85 90 95Leu Pro Glu Trp Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Gly 100 105110 Ser Thr Ser Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser Thr Lys 115120 125 Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly130 135 140 Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe SerSer 145 150 155 160 Tyr Ala Met Ser Trp Val Arg Gln Ala Pro Gly Lys GlyLeu Glu Trp 165 170 175 Val Ser Val Ile Ser Gly Lys Thr Asp Gly Gly SerThr Tyr Tyr Ala 180 185 190 Asp Ser Val Lys Gly Arg Phe Thr Ile Ser ArgAsp Asn Ser Lys Asn 195 200 205 Thr Leu Tyr Leu Gln Met Asn Ser Leu ArgAla Glu Asp Thr Ala Val 210 215 220 Tyr Tyr Cys Ala Arg Gly Arg Xaa GlyXaa Ser Leu Ser Gly Xaa Tyr 225 230 235 240 Tyr Tyr Tyr His Tyr Phe AspTyr Trp Gly Gln Gly Thr Leu Val Thr 245 250 255 Val Ser Ser Lys Lys LysLys Lys Lys Lys Lys Val Thr Val Ser Lys 260 265 270 Lys Lys Lys Lys LysLys Lys Val Thr Val Ser 275 280 7 282 PRT Artificial SequenceDescription of Artificial Sequence C6.5/218 sFv 7 Gln Ser Val Leu ThrGln Pro Pro Ser Val Ser Ala Ala Pro Gly Gln 1 5 10 15 Lys Val Thr IleSer Cys Ser Gly Ser Ser Ser Asn Ile Gly Asn Asn 20 25 30 Tyr Val Ser TrpTyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Leu Leu 35 40 45 Ile Tyr Gly HisThr Asn Arg Pro Ala Gly Val Pro Asp Arg Phe Ser 50 55 60 Gly Ser Lys SerGly Thr Ser Ala Ser Leu Ala Ile Ser Gly Phe Arg 65 70 75 80 Ser Glu AspGlu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu 85 90 95 Ser Gly TrpVal Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly Gly 100 105 110 Ser ThrSer Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser Thr Lys 115 120 125 GlyGln Val Gln Leu Leu Gln Ser Gly Ala Glu Leu Lys Lys Pro Gly 130 135 140Glu Ser Leu Lys Ile Ser Cys Lys Gly Ser Gly Tyr Ser Phe Thr Ser 145 150155 160 Tyr Trp Ile Ala Trp Val Arg Gln Met Pro Gly Lys Gly Leu Glu Tyr165 170 175 Met Gly Leu Ile Tyr Pro Gly Asp Ser Asp Thr Lys Tyr Ser ProSer 180 185 190 Phe Gln Gly Gln Val Thr Ile Ser Val Asp Lys Ser Val SerThr Ala 195 200 205 Tyr Leu Gln Trp Ser Ser Leu Lys Pro Ser Asp Ser AlaVal Tyr Phe 210 215 220 Cys Ala Arg His Asp Val Gly Tyr Cys Ser Ser SerAsn Cys Ala Lys 225 230 235 240 Trp Pro Glu Tyr Phe Gln His Trp Gly GlnGly Thr Leu Val Thr Val 245 250 255 Ser Ser Lys Lys Lys Lys Lys Lys LysLys Val Thr Val Ser Lys Lys 260 265 270 Lys Lys Lys Lys Lys Lys Val ThrVal Ser 275 280 8 84 PRT Artificial Sequence Description of ArtificialSequence Nucleic acid binding region 8 Lys Lys Lys Lys Lys Lys Lys LysLys Lys Lys Lys Lys Lys Lys Lys 1 5 10 15 Lys Lys Lys Lys Lys Lys LysLys Lys Lys Lys Lys Lys Lys Lys Lys 20 25 30 Lys Lys Lys Lys Lys Lys LysLys Lys Lys Lys Lys Lys Lys Lys Lys 35 40 45 Lys Lys Lys Lys Lys Lys LysLys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 50 55 60 Xaa Xaa Xaa Xaa Xaa Xaa XaaXaa Xaa Xaa Xaa Xaa Lys Lys Lys Lys 65 70 75 80 Lys Lys Lys Lys 9 84 PRTArtificial Sequence Description of Artificial Sequence Nucleic acidbinding region 9 Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg ArgArg Arg 1 5 10 15 Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg ArgArg Arg Arg 20 25 30 Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg ArgArg Arg Arg 35 40 45 Arg Arg Arg Arg Arg Arg Arg Arg Xaa Xaa Xaa Xaa XaaXaa Xaa Xaa 50 55 60 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa ArgArg Arg Arg 65 70 75 80 Arg Arg Arg Arg 10 83 PRT Artificial SequenceDescription of Artificial Sequence Nucleic acid binding region 10 ArgLys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys 1 5 10 15Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys 20 25 30Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys Arg Lys 35 40 45Arg Lys Arg Lys Arg Lys Arg Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 50 55 60Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Lys Arg Lys 65 70 7580 Arg Lys Arg 11 84 PRT Artificial Sequence Description of ArtificialSequence Nucleic acid binding region 11 Arg Arg Arg Arg Arg Arg Arg ArgArg Arg Arg Arg Arg Arg Arg Arg 1 5 10 15 Arg Arg Arg Arg Arg Arg ArgArg Arg Arg Arg Arg Arg Arg Arg Arg 20 25 30 Arg Arg Arg Arg Arg Arg ArgArg Arg Arg Arg Arg Arg Arg Arg Arg 35 40 45 Arg Arg Arg Arg Arg Arg ArgArg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 50 55 60 Xaa Xaa Xaa Xaa Xaa Xaa XaaXaa Xaa Xaa Xaa Xaa Lys Lys Lys Lys 65 70 75 80 Lys Lys Lys Lys 12 36DNA Artificial Sequence Description of Artificial SequenceOligonucleotide 12 gtcaccgtct ccaaaaagaa gaaaaaaaag aaaaag 36 13 36 DNAArtificial Sequence Description of Artificial Sequence Oligonucleotide13 gtgacctttt tctttttttt cttctttttg aagacg 36

What is claimed is:
 1. A method of delivering nucleic acids to a cellcomprising: (1) providing a basic amino acid tailed single-chainantigen-binding polypeptide capable of delivering nucleic acids to acell comprising: (a) a first polypeptide comprising the antigen bindingportion of the variable region of an antibody heavy or light chain; (b)a second polypeptide comprising the antigen binding portion of thevariable region of an antibody heavy or light chain; and (c) a peptidelinker linking the first and second polypeptides (a) and (b) into asingle chain polypeptide having an antigen binding site, wherein at itsC-terminus, N-terminus, or both of polypeptide (a), (b) or both, thesingle-chain antigen-binding polypeptide has an amount of basic aminoacid residues sufficient to bind nucleic acids, wherein the basic aminoacid residues are selected from the group consisting of: Lys, Arg and acombination thereof, and wherein the basic amino acid residues bindsnucleic acid and wherein the single-chain antigen-binding polypeptidebinds antigen; (2) allowing a nucleic acid to bind to the basic aminotailed single-chain antigen-binding polypeptide; and (3) transforming acell with the nucleic acid bound basic amino acid tailed single-chainantigen-binding polypeptide.
 2. The method of claim 1 wherein said firstpolypeptide (a) of said single-chain antigen-binding polypeptidecomprises the antigen binding portion of the variable region of anantibody light chain and said second polypeptide (b) comprises theantigen binding portion of the variable region of an antibody heavychain.
 3. The method of claim 1 wherein said cell is a mammalian cell.4. The method of claim 1, wherein the amount of basic amino acidresidues of the single-chain antigen-binding polypeptide comprise atleast 2 to 8 groups of eight consecutive lysine residues, wherein eachgroup of eight consecutive lysine residues is separated from theadjacent group by 0-20 amino acid residues.
 5. The method of claim 1,wherein the amount of basic amino acid residues comprise at least 2 to 8groups of eight consecutive arginine residues, wherein each group ofeight consecutive arginine residues is separated from the adjacent groupby 0-20 amino acid residues.
 6. The method of claim 1, wherein theamount of basic amino acid residues comprise at least 2 to 8 groups ofeight consecutive residues consisting of lysine and arginine residues,wherein each group of eight consecutive lysine and arginine residues isseparated from the adjacent group by 0-20 amino acid residues.